( 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
10 May 2007 (10.05,2007) 





PCT 



(10) International Publication Number 

WO 2007/053702 A2 



(51) International Patent Classification: 
C07H 19/067 (2006.01) 

(21) International Application Number: 

PCT/US2006/042698 

(22) International Filing Date: 31 October 2006 (31.10.2006) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/732,373 



31 October2005 (31.10.2005) US 



(71) Applicant (for all designated States except US): THE 
TRUSTEES OF COLUMBIA UNIVERSITY IN 
THE CITY OF NEW YORK [US/US]; WEST 116TH 
STREET AND BROADWAY, New York, NY 10027 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): JU, Jingyue 
[US/US]; 267 MARIETTA ST., Englewood Cliffs, 
NJ 07632 (US). TURRO, Nicholas, J. [US/US]; 125 
DOWNEY DR., Tenafly, NJ 07670 (US). MENG, Qin- 
gling [CN/US]; 529 WEST 111TH ST. APT. 22A, New 
York, NY 10027 (US). KIM, Dae, H. [US/US]; 1000 
MORNINGSIDE DR., APT. 5H, New York, NY 10027 
(US). BI, Lanrong [CN/US]; 610 WEST 1 15TH ST. APT. 
63A, New York, NY 10025 (US). 



(74) Agent: WHITE, John, P.; COOPER & DUNHAM LLP, 
1185 AVENUE OF THE AMERICAS, New York, NY 
10036 (US). 

(81) Designated States (unless otherwise indicated, for every 
kind of national protection available): AE, AG, AL, AM, 
AT, AU, AZ, B A, BB, BG, BR, BW, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FI, 
GB, GD, GE, GH, GM, GT, HN, HR, HU, ED, IL, IN, IS, 
JP, KE, KG, KM, KN, KP, KR, KZ, LA, LC, LK, LR, LS, 
LT, LU, LV, LY, MA, MD, MG, MK, MN, MW, MX, MY, 
MZ, NA, NG, NI, NO, NZ, OM, PG, PH, PL, PT, RO, RS, 
RU, SC, SD, SE, SG, SK, SL, SM, SV, SY, TJ, TM, TN, 
TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. 

(84) Designated States ( unless otherwise indicated, for every 
kind of regional protection available): ARJPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HU, IE, IS, IT, LT, LU, LV, MC, NL, PL, PT, 
RO, SE, SI, SK, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, 
GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



IT) 



Applicants: Jingyue Ju 
Serial No.: 10/591,520 
Filed: March 3, 2005 
Exhibit 2 



(54) Title: SYNTHESIS OF FOUR COLOR 3'-0-ALLYL MODIFIED PHOTOCLEAVABLE FLUORESCENT NUCLEOTIDES 

fsj AND RELATED METHODS 



o 

(57) Abstract: This invention provides a process for making 3*-0- allyl-dGTP-PC-Biodopv-FL-510, 3' -0-allvl-dATP-PC-ROX. 
3'-O-allyl-dCTP-PC-Bodipy-650 and 3* -O-allyl-dUTP-PC- R6G, and related methods. 



I 

WO 2007/053702 



1 



f 

PCTYUS2006/042698 



SYNTHESIS OF FOUR-COLOR 3 ' -O-AIiLYL MODIFIED 
PHOTOCLEAVABLE FLUORESCENT NUCLEOTIDES 
5 AND RELATED METHODS 

The invention disclosed herein was made with Government 
support under Center of Excellence in Genomic Science 
grant number IP50 HG002806-01 from the National 
10 Institutes of Health, U.S. Department of Health and 
Human Services. Accordingly, the U.S. Government has 
certain rights in this invention. 

Throughout this application, various publications are 
15 referenced in parentheses by number. Full citations for 
these references may be found at the end of each 
experimental section. The disclosures of these 
publications in their entireties are hereby 
incorporated by reference into this application to more 
2 0 fully describe the state of the art to which this 
invention pertains . 

Background of the Invention 

* 

2 5 DNA n sequencing is a fundamental tool for biological 
research and medical diagnostics, driving disease gene 
discovery and gene function studies. DNA sequencing by 
synthesis (SBS) using reversible fluorescent nucleotide 
terminatorsl is a potentially efficient approach to 

30 address the limitations of current DNA sequencing 
techniques, such as throughput and data accuracy. A 3'- 
O-allyl photocleavable (PC) fluorescent nucleotide 
analogue, 3 ' -O-allyl-dUTP-PC-Bodipy-FL-510, as a 
reversible terminator for SBS has previously been 

35 reported (2) . The nucleotide can be efficiently 
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incorporated by DNA polymerase into a growing DNA 
strand to terminate the polymerase reaction. After that 
the fluorophore can be photocleaved quantitatively by 
irradiation at 355 nm, and the allyl group is rapidly 
5 and efficiently removed by using a Pd-catalyzed 
reaction in water to regenerate a free 3' -OH group to 
reinitiate the polymerase reaction. 
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Summary 

This invention provides a method for making 3'0-allyl- 
5 dGTP-PC-Bodipy-FL-510 comprising performing the steps 
set forth in Figure 7 . This invention also provides a 
method for making 3' -O-allyl-dATP-PC-ROX comprising 
performing the steps set forth in Figure 8 . This 
invention also provides a method for making 3'-0-allyl- 
10 dCTP-PC-Bodipy-650 comprising performing the steps set 
forth in Figure 9. This invention also provides a 
method for making 3' -0-allyl-dUTP-PC-R6G comprising 
performing the steps set forth in Figure 10. 

15 This invention also provides a method for making method 
for determining the sequence of a DNA comprising 
performing the following steps for each residue of the 
DNA to be sequenced: 

(a) contacting the DNA with a DNA polymerase in 

20 the presence of (i) a primer and (ii) four 

fluorescent nucleotide analogues under 

conditions permitting the DNA polymerase to 
catalyze DNA synthesis , wherein (1) the 
nucleotide analogues consist of an analogue of 

25 dGTP, an analogue of dCTP, an analogue of dTTP 

or dUTP, and an analogue of dATP, (2) each 
nucleotide analogue comprises (i) a base 
selected from the group consisting of adenine, 
guanine , cytosine, thymine or uracil , and 

30 analogues thereof, (ii) a deoxyribose, (iii) a 

fluorophore photocleavably attached to the 
base, and (iv) an allyl moiety bound to the 3'- 
oxygen of the deoxyribose, so that a nucleotide 
analogue complementary to the residue being 
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sequenced is bound to the DNA by the DNA 
polymerase, and (3) each of the four analogues 
. has a predetermined fluorescence wavelength 
which is different than the fluorescence 
5 wavelengths of the other three analogues; 

(b) removing unbound nucleotide analogues; 

(c) determining the identity of the bound 
nucleotide analogues; and 

(d) following step (c) ,• except with respect to 
10 the final DNA residue to be sequenced, (i) 

chemically cleaving from the bound nucleotide 
analogue the allyl moiety bound to the 3'- 
oxygen atom of the deoxyribose and (ii) 
photocleaving the fluorophore from the bound 
15 nucleotide analogue, wherein steps (d) (i) and 

(d) (ii) can be performed concurrently or in any 
order, and step (d) (i) is performed using a Pd 
catalyst at a pH of about 8.8, 

thereby determining the sequence of the DNA . 

20 

This invention also provides a method for removing an 
allyl moiety from the 3' -oxygen of a nucleotide 
analogue's deoxyribose moiety comprising the step of 
contacting the nucleotide analogue with a Pd catalyst 
2 5 at a pH of about 8.8. 
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Brief Description of the Figures 

Figure 1: Synthesis of a 3'-0-allyl modified 19-mer 
5 oligonucleotide . 

Figure 2: Synthesis of 3' -O-allyl-dUTP-PC-Bodipy-FL- 
510. 

10 Figure 3: Schematic representation (left) and step-by- 
step MALDI-TOF MS results (right) for the deallylation 
of a 3 ' -O-allyl-modif ied oligonucleotide and the use of 
the deallylated oligonucleotide as a primer in a 
polymerase extension reaction. (A) Peak at m/z 5871 

15 corresponding to the HPLC-purif ied 3'-0-allyl modified 
19-mer oligonucleotide. (B) Peak at m/z 5831 
corresponding to the above oligonucleotide without the 
allyl group, obtained after 30 sec of incubation with 
Na 2 PdCl 4 and TPPTS [P (PhS0 3 Na) 3 ] at 7 0 °C. (C) Peak at 

2 0 m/z 6535 corresponding to the extension of the 
deallylated oligonucleotide by Biotin-ll-ddGTP using 
Thermo Sequenase DNA polymerase. 

Figure 4: One entire polymerase reaction cycle using 
25 3' -O-allyl-dUTP-PC-Bodipy-FL-510 as a reversible 
terminator. 

Figure 5: MALDI-TOF MS results for each step of a 
polymerase reaction cycle using 3' -O-allyl-dUTP-PC- 
30 Bodipy-FL-510 as a reversible terminator. (A) Peak at 
m/z 67 87 corresponding to the primer extension product 
11 obtained using 3' -O-allyl-dUTP-PC-Bodipy-FL-510 and 

the 9°N Polymerase (exo-) A485L/Y409V. (B) Peak at m/z 
62 92 corresponding to the photocleavage product 12. (C) 
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Peak at m/z 6252 corresponding to the photocleavage 
product without the allyl group 13 obtained after 90 
sees of incubation with the catalyst and ligand at 7 0 
°C. (D) Peak at m/z 7133 corresponding to the extension 
5 product 14 from the purified deallylated product using 
dGTP-PO-Bodipy-FL-510 and Thermo Sequenase DNA 
Polymerase- (E) Peak at m/z 6637 corresponding to the 
photocleavage product 15. 

10 Figure 6: Structures of four-color 3'-0-allyl modified 
photocleavable fluorescent nucleotides. 

Figure 7: Synthesis of 3' -O-allyl-dGTP-PC-Bodipy-FL-510 
10 . 

15 

Figure 8. Synthesis of 3' -O-allyl-dATP-PC-ROX 19. 



Figure 9: Synthesis of 3' -O-allyl-dCTP-PC-Bodipy-650 
26 . 

20 

Figure 10. Synthesis of 3 ' -0-allyl-dUTP-PC-R6G 33. 

Figure 11. Polymerase DNA extension reaction using 3'- 
O-allyl-dGTP-POBodipy-FL-510 10 as a reversible 
2 5 terminator . 

Figure 12 . A continuous polymerase extension using 10 
as a reversible terminator (left) and MALDI-TOF MS 
spectra of consecutive extension photocleavage and 
30 deallylation products (right) . 

Figure 13. (Left) Scheme showing 3' -O-allyl-dATP-PC- 
ROX 19 as a base specific reversible terminator for DNA 
primer extension, photocleavage and deallylation; 
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(right) MALDI-TOF MS spectra for incorporation, 
photocleavage and deallylation products . 

Figure 14. (Left) Scheme showing 3' -O-allyl-dCTP-PC- 
5 Bodipy-650 26 as a base specific reversible terminator 
for DNA primer extension, photocleavage and 
deallylation; (right) MALDI-TOF MS spectra for 
incorporation, photocleavage and deallylation products. 

10 Figure 15. (Left) Scheme showing 3' -O-allyl-dUTP-PC- 
R6G 33 as a base specific reversible terminator for DNA 
primer extension, photocleavage and deallylation; 
(right) MALDI-TOF MS spectra for incorporation, 
photocleavage and deallylation products . 

15 

Figure 16: Structures of dGTP-PC-Bodipy-FL-510 (X ahs (niax) 
- 502 nm; X em (max) = 510 nm) , dUTP-PC-R6G (X ah3 (max) = 525 
nm; X em (max) = 550 nm) , dATP-PC-ROX (X«b a (max) = 575 nm; 
A-em (max) = 602 nm) , and dCTP-PC-Bodipy-650 (X^a (max) = 630 
2 0 nm; A, em (max) = 650 nm) . 



Figure 17: Synthesis of photocleavable fluorescent 
nucleotides. (a) acetonitrile or DMF/1 M NaHC0 3 
solution; (b) N, N' -disuccinimidyl carbonate (DSC), 
2 5 triethylamine; (c) 0.1 M Na 2 C0 3 /NaHC0 3 aqueous buffer 
(pH 8.5-8.7) . 



Figure 18: DNA extension reaction performed in solution 
phase to characterize the 4 different photocleavable 
30 fluorescent nucleotide analogues (dUTP-PC-R6G, dGTP-PC- 
Bodipy-FL-510, dATP-PC-ROX, dCTP-POBodipy-650 ) . After 
each extension reaction, the DNA extension product is 
purified by HPLC for MALDI-TOF MS measurement, to 
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verify that it is the correct extension product. 
Photolysis is performed to produce a DNA product that 
is used as a primer for the next DNA extension 
reaction. 



\ 



Figure 19: The polymerase extension scheme (left) and 
MALDI-TOF MS spectra of the four consecutive extension 
products and their photocleavage products (right) . 
Primer extended with dUTP-PC-R6G (1) , and its 

10 photocleavage product 2; Product 2 extended with dGTP- 
PC-Bodipy-FL-510 (3) , and its photocleavage product 4; 
Product 4 extended with dATP-PC-ROX (5) , and its 
photocleavage product 6; Product 6 extended with dCTP- 
PC-Bodipy-650 (7) , and its photocleavage product 8. 

15 After 10 seconds of irradiation with a laser at 355 nm, 
photocleavage is complete with all the f ltiorophores 
cleaved from the extended DNA products . 

Figure 20: Immobilization of an azido-labeled PCR 
2 0 product on an alkynyl-f unctionalized surface and a 
ligation reaction between the immobilized single- 
stranded DNA template and a loop primer to form a self- 
priming DNA moiety on the chip. The sequence of the 
loop primer is shown in (A) . 

25 

Figure 21: Schematic representation of SBS on a chip 
using four PC fluorescent nucleotides (Upper panel) and 
the scanned fluorescence images for each step of SBS on 
a chip (Lower panel) . (1) Incorporation of dATP-PC-ROX; 
30 (2) Photocleavage of PC-ROX; (3) Incorporation of dGTP- 
PC-Bodipy-FL-510; (4) Photocleavage of PC-Bodipy-FL- 
510; (5) Incorporation of dATP-PC-ROX; (6) 

Photocleavage of PC-ROX; (7) Incorporation of dCTP-PC- 
Bodipy-650; (8) Photocleavage of PC-Bodipy-650; (9) 
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Incorporation of dUTP-PC-R6G; (10) Photocleavage of pc- 
R6G; (11) Incorporation of dATP-PC-ROX; (12) 
Photocleavage of PC-ROX; (13) Incorporation of dUTP-PC- 
R6G; (14) Photocleavage of PC-R6G; (15) Incorporation 
5 .of dATP-PC-ROX; (16) Photocleavage of PC-ROX; (17) 
Incorporation of dGTP-PC-Bodipy-FL-510; (18) 

Photocleavage of PC-Bodipy-FL-510; (19) Incorporation 
of dUTP-PC-R6G; (20) Photocleavage of PC-R6G; (21) 
Incorporation of dCTP-PC-Bodipy- 650; (22) Photocleavage" 
of PC-Bodipy-650; (23) Incorporation of dATP-PC-ROX; 
(24) Photocleavage of PC-ROX. 
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Figure 22: 4-Color DNA sequencing data using 3'-o- 
allyl-dGTP-PC-Bodipy-FL-510, 3' -O-allyl-dATP-PC-ROX, 

15 3'-0-allyl-dUTP-PC-R6G and 3 ' -O-allyl-dCTP-PC-Bodipy- 
650 as reversible terminators and a 4-color Laser 
Scanner. Scanned fluorescence • images for each step of 
SBS on a DNA chip to sequence a DNA template with 
homopolymeric regions . 

20 
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Detailed Description of the Invention 

Terms 

The following definitions are presented as an aid in 



unaersuanaincf 


tnis 


invention : 


A 




Adenine; 


10 C 




Cytosine; 


DNA 




Deoxyribonucleic acid; 


G 




Guanine; 


PC 




Photocleavable 


RNA 




Ribonucleic acid; 


15 SBS 




Sequencing by synthesis; 


T 




Thymine ; and 


U 




Uracil . 



"Nucleic acid " shall mean any nucleic acid, including, . 

2 0 without limitation, DNA, RNA and hybrids thereof. The 
nucleic acid bases that form nucleic acid molecules can 
be the bases A, C, G, T and U, as well as derivatives 
thereof. Derivatives of these bases are well known in 
the art, and are exemplified in PCR Systems, Reagents 

25 and Consumables (Perkin Elmer Catalogup 1996 1997, 

Roche Molecular Systems, Inc., Branchburg, New Jersey, 
USA) . 

As used herein, M self-priming moiety'' shall mean a 
30 nucleic acid moiety covalently bound to a nucleic acid 
to be transcribed, wherein the bound nucleic acid 
moiety, through its proximity ' with the transcription 
initiation site of the nucleic acid to be transcribed, 
permits transcription of the nucleic acid under nucleic 
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acid polymerization-permitting conditions (e.g. the 
presence of a suitable polymerase, nucleotides and 
other reagents) . That is, the self-priming moiety 
permits the same result (i.e. transcription) as does a 
5 non-bound primer. In one embodiment, the self-priming 
moiety is a single stranded nucleic acid having a 
hairpin structure. Examples of such self-priming 
moieties are shown in the Figures . 



10 "Hybridize" shall mean the annealing of one 
single-stranded nucleic acid to another nucleic acid 
based on sequence complementarity. The propensity for 
hybridization between nucleic acids depends on the 
temperature and ionic strength of their milieu, the 

15 length of the nucleic acids and the degree of 
complementarity. The effect of these parameters on 
hybridization is well known in the art (see Sambrook J, 
Fritsch EF, Maniatis T . 198 9. Molecular cloning: a 
laboratory manual. Cold Spring Harbor Laboratory Press, 

2 0 New York. ) 



As used herein, "nucleotide analogue' 7 shall mean an 
analogue of A, G, C, T or U (that is, an analogue of a 
nucleotide comprising the base A, G, C, T or U) which 

2 5 is recognized by DNA or RNA polymerase (whichever is 
applicable) and incorporated into a strand of DNA or 
RNA (whichever is appropriate) . Examples of nucleotide 
analogues include, without limitation 7-deaza-adenine , 
7-deaza-guanine, the analogues of deoxynucleotides 

30 shown in Figure 6, analogues in which a label is 
attached through a cleavable linker to the 5-position 
of cytosine or thymine or to the 7-position of deaza- 
adenine or deaza-guanine, analogues in which a small 
chemical moiety such as -CH 2 CH=CH 2 is used to cap the - 
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OH group at the 3' -position of deoxyribose, and 
analogues of related dideoxynucleotides . Nucleotide 
analogues, including dideoxynucleotide analogues, and 
DNA polymerase-based DNA sequencing are also described 
5 in U.S. Patent No. 6,664,079. 

1,3 dipolar azide-alkyne cycloaddition chemistry is 
described in WO 2005/084367 and PCT/US03/39354 , the 
contents of each of which are hereby incorporated by 
10 reference. 



All embodiments of U.S. Patent No. 6, 664, 079 (the 
contents of which are hereby incorporated by reference) 
with regard to sequencing a nucleic acid are 
15 specifically envisioned here. 

With regard to the synthesis of the nucleotide 
analogues disclosed herein, other fluorophores or 
chromophores to be photocleavably attached to the base 
2 0 of the analogue are envisioned. In addition, 
combinatorial fluorescence energy tags as described in 
U.S. Patent No. 6,627,748 (the contents of which are 
hereby incorporated by reference) may be used in place 
of the fluorophores described herein. 



25 
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Embodiments of the Invention 

This invention provides a method for making 3'0-allyl- 
dGTP-PC-Bodipy-FL-510 comprising performing the steps 
set forth in Figure 7. This invention also provides a 
method for making 3' -0-allyl-dATP-PC-ROX comprising 
performing the steps set forth in Figure 8. This 
invention also provides a method for making 3'-0-allyl- 
dCTP-PC-Bodipy-650 comprising performing the steps set 



! 



WO 2007/053702 PCT7US2006/042698 

13 

■ forth in Figure 9. This invention also provides a 
method for making 3' -0-allyl-dUTP-PC-R6G comprising 
performing the steps set forth in Figure 10. 

5 This invention also provides a method for making method 
for determining the sequence of a DNA comprising 
performing the following steps for each residue of the 
DNA to be sequenced: 

(a) contacting the DNA with a DNA polymerase in 
10 the presence of (i) a primer and (ii) four 

fluorescent nucleotide analogues under 
conditions permitting the DNA polymerase to 
catalyze DNA synthesis, wherein (1) the 
nucleotide analogues consist of an analogue of 
15 dGTP, an analogue of dCTP, an analogue of dTTP 

or dUTP, and an analogue of dATP, (2) each 
nucleotide analogue comprises (i) a base 
selected from the group consisting of adenine, 
guanine, cytosine, thymine or uracil, and 
20 analogues thereof, (ii) a deoxyribose, (iii) a 

fluorophore photocleavably attached to the 
base, and (iv) an allyl moiety bound to the 3'- 
oxygen of the deoxyribose, so that a nucleotide 
analogue complementary to the residue being 
25 sequenced is bound to the DNA by the DNA 

polymerase, and (3) each of the four analogues 
has a predetermined fluorescence wavelength 
which is different than the fluorescence 
wavelengths of the other three analogues; 
30 ' (t>) removing unbound nucleotide analogues ; 

(c) determining the identity of the bound 
nucleotide analogues; and 

(d) following step (c) , except with respect to 
the final DNA residue to be sequenced, (i) 
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chemically cleaving from the bound nucleotide 
analogue the allyl moiety bound to the 3'- 
oxygen atom of the deoxyribose and (ii) 

photocleaving the fluorophore from the bound 
5 nucleotide analogue, wherein steps (d) (i) and 

(d) (ii) can be performed concurrently or in any 

order, and step (d) (i) is performed using a Pd 

catalyst at a pH of about 8.8, 

thereby determining the sequence of the DNA. 



10 



In one embodiment of the instant method, chemically 

* 

cleaving the allyl moiety bound to the 3 '-oxygen atom 
is performed using Na 2 PdCl 4 . 



15 In one embodiment of the instant method, the primer is 
a self-priming moiety. 

In one embodiment of the instant method, the DNA is 
bound to a solid substrate. In one embodiment of the 
instant method, the DNA is bound to the solid substrate 
via 1,3-dipolar azide-alkyne cycloaddition chemistry. 
In one embodiment of the instant method, about 1000 or 
fewer copies of the DNA are bound to the solid 
substrate . 



20 
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In one embodiment of the instant method, the four 
fluorescent nucleotide analogues are. 3' -O-allyl-dGTP- 
PC-Bodipy-FL-510, 3' -O-allyl-dATP-PC-ROX, 3' -O-allyl- 
dCTP-PC-Bodipy-650 and 3 ' -0-allyl-dUTP-PC-R6G . 

In one embodiment of the instant method, the DNA 
polymerase is a 9°N polymerase. 
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This invention also provides a method for removing an 
allyl moiety from the 3 '-oxygen of a nucleotide 
analogue's deoxyribose moiety comprising the step of 
contacting the nucleotide analogue with a Pd catalyst 
5 at a pH of about 8.8. In one embodiment of the instant 
method, the Pd catalyst is Na 2 PdCl 4 . 

In embodiments of this invention the sequencing methods 
described can be applied, mutatis mutandis, to 
sequencing an RNA molecule or an RNA/DNA hybrid 
molecule . 



10 



This invention will be better understood by reference 
to the Experimental Details which follow, but those 
15 skilled in the art will readily appreciate that the 
specific experiments detailed are only illustrative of 
the invention as described more fully in the claims 
which follow thereafter. 
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Experimental Details 

The design and synthesis of a complete set of four- 
color 3'-0-allyl modified photocleavable fluorescent 

5 nucleotides as reversible terminators for SBS is 
disclosed herein. 
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Example 1 



10 Synopsis 

DNA sequencing by synthesis (SBS) offers a new approach 
for potential high-throughput sequencing applications. 
In this method, the ability of an incoming nucleotide 
to act as a reversible terminator for a DNA polymerase 
reaction is an important requirement to unambiguously 
determine the identity of the incorporated nucleotide 
before the next nucleotide is added. A free 3' -OH group 
on the terminal nucleotide of the primer is necessary 
for the DNA polymerase to incorporate an incoming 
2 0 nucleotide. Therefore, if the 3' -OH group of an 
incoming nucleotide is capped by a chemical moiety, it 
will cause the polymerase reaction to terminate after 
the nucleotide is incorporated into the DNA strand. If 
the capping group is subsequently removed to generate a 
25 free 3' -OH, the polymerase reaction will reinitialize. 
Here, the design and synthesis of a 3' -modified 
photocleavable fluorescent nucleotide, 3' -O-allyl-dUTP- 
PC-Bodipy-FL-510, as a reversible terminator for SBS is 
disclosed. This nucleotide analogue contains an allyl 
moiety capping the 3' -OH group and a fluorophore 
Bodipy-FL-510 linked to the 5 position of the uracil 
through a photocleavable 2-nitrobenzyl linker. In 
addition, it is shown that this nucleotide is a good 
substrate for a DNA polymerase. After the nucleotide 
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was successfully incorporated into a growing DNA strand 
and the fluorophore photocleaved, the allyl group was 
removed using a Pd catalyzed reaction to reinitiate the 
polymerase reaction, thereby establishing the 
5 feasibility of using such nucleotide analogues as 
reversible terminators for SBS . 

Introduction 

10 The completion of the Human Genome Project (1, 2) has 
led to an increased demand for high- throughput and 
rapid DNA sequencing methods to identify genetic 
variants for applications in pharmacogenomics (3) , 
disease gene discovery (4, 5) and gene function studies 

15 (6) . Current state-of-the-art DNA sequencing 
technologies (7-11) to some extent address the accuracy 
and throughput requirements but suffer limitations with 
respect to cost and data quality. Thus, new DNA 
sequencing approach is required to broaden the 

2 0 applications of genomic information in medical research 
and health care. In this regard, DNA sequencing by 
synthesis (SBS) offers an alternative approach to 
possibly address the limitations of current DNA 
sequencing techniques. The design of a parallel chip 

2 5 based SBS system, which uses a self-priming ■ DNA 
template covalently linked to the glass surface of a 
chip and four modified nucleotides has previously been 
described (12-14). The nucleotides are modified such 
that they have a photocleavable fluorescent moiety 

30 attached to the base (5 position of pyrimidines, 7 
position of purines) and a chemically deavable group 
to cap the 3' -OH. When the correct nucleotide is 
incorporated in a DNA polymerase reaction, specific to 
the template sequence, the reaction is temporarily 
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terminated because of the lack of a free 3 '-OH group. 
After the fluorescent signal is detected and the 
nucleotide identified, the 3' -OH needs to be 
regenerated in order to continue incorporating the next 
5 nucleotide. In Example 3 hereinbelow, it is 
demonstrated that 4 photocleavable fluorescent 
nucleotides can be efficiently incorporated by DNA 
polymerase into a growing DNA strand base specifically 
in a polymerase extension reaction, and the 
fluorophores can be completely removed by photocleavage 
under near UV irradiation (A,~355 run) with high 
efficiency (15). Using this system in a four-color 
sequencing assay, accurate identification of multiple 
bases in a self-priming DNA template covalently 
15 attached to a glass surface can be achieved. 



10 



Another important requirement for this approach to 
sequence DNA unambiguously is a suitable chemical 
moiety to cap the 3' -OH of the nucleotide such that it 

2 0 terminates the polymerase reaction to allow the 
identification of the incorporated nucleotide. The 
capping group then needs to be efficiently removed to 
regenerate the 3' -OH thereby allowing the polymerase 
reaction to continue. Thus, the photocleavable 

2 5 fluorescent nucleotides used in SBS must be reversible 
terminators of the DNA polymerase reaction to allow the 
detection of the fluorescent signal such that the 
complementary DNA synthesis and sequence identification 
can be efficiently performed in tandem. The principal 

30 challenge posed by this requirement is the 
incorporation ability of the 3 '-modified nucleotide by 
DNA polymerase into the growing DNA strand. The 3'- 
position on the sugar ring of a nucleotide is very 
close to the amino acid residues in the active site of 
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the DNA polymerase. This is supported by the 3-D 
structure of the previously determined ternary 
complexes of rat DNA polymerase, a DNA template-primer , 
and dideoxycytidine triphosphate (16) . Thus, any bulky 
5 modification at this position provides steric hindrance 
to the DNA polymerase and prevents the nucleotide from 
being incorporated. A second challenge is the efficient 
removal of the capping group once the fluorescence 
signal is detected. Thus, it is important to use a 
functional group small enough to present no hindrance 
to DNA polymerase, stable enough to withstand DNA 
extension reaction conditions, and able to be removed 
easily and rapidly to regenerate a free 3' -OH under, 
specific conditions. 
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Results 



Numerous studies, have previously been undertaken to 
identify a 3' -modified nucleotide as a substrate for 
DNA polymerase. 3 ' -O-methyl-nucleotides have been shown 
to be good substrates for several polymerases (17) . 
However, the procedure to chemically cleave the methyl 
group is stringent and requires anhydrous conditions . 
Thus, it is not practical to use a methyl group to cap 
the 3 T -OH group for SBS . It has been reported that 
25 nucleotides with ether linkages at the 3' position can 
be incorporated by some DNA polymerases, while those 
with ester linkages are not generally accepted by most 
of the polymerases tested (18) . Significant efforts 
have been dedicated to evaluate a wide variety of 3'- 
30 modified nucleotides to be used as terminators for 
various DNA polymerases and reverse transcriptases but 
none of . the functional groups tested have had 
established methods to regenerate a free 3'-OH (19-22). 
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It is known that stable chemical functionalities such 
as allyl (-CH 2 -CH=CH 2 ) and methoxymethyl (-CH 2 -0-CH 3 ) 
groups can be used to cap an OH group, and can be 
cleaved chemically with high yield (23, 24) . Use of 
5 such groups as reversible caps for the 3' -OH of the 
nucleotide for SBS (12) is investigated here, and the 
establishment of the allyl group as a 3 '-OH capping 
moiety for the nucleotide analogues that can be used in 
SBS is revealed- The choice of this group was based on 
the fact that the allyl moiety, being relatively small, 
would not provide significant hindrance for the 
polymerase reaction, and therefore allow the incoming 
3'-0-allyl modified nucleotide analogue to be accepted 
by DNA polymerase. Furthermore, it was proposed to 
15 remove this group using catalytic deallylation . Here, 
the synthesis of a photocleavable fluorescent 
nucleotide analogue, 3' -O-allyl-dUTP-PC-Bodipy-FL-510 , 
that can be efficiently incorporated by DNA polymerase 
into a growing DNA strand is shown. The allyl group can 
be rapidly and completely removed by a Pd catalyzed 
reaction to regenerate a 3' -OH group and the 
deallylated DNA can then allow reinitiation of the 
polymerase reaction to incorporate the subsequent 
nucleotide analogue . 
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All chemicals were purchased from Sigma-Aldrich unless 
otherwise indicated. Oligonucleotides used as primers 
or templates were synthesized * on an EXPEDITE Nucleic 
30' Acid Synthesizer (Applied Biosystems) . X H NMR spectra 
were recorded on a Bruker 400 spectrometer, while 13 C 
and P NMR spectra were recorded on a Bruker 300 
spectrometer. High-resolution MS (HRMS) data were 
obtained by using a JEOL JMS HX 110A mass spectrometer. 
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Mass measurement of DNA was made on a Voyager DE 
matrix-assisted laser desorption ionization time-of- 
flight (MALDI-TOF) mass spectrometer (Applied 
Biosystems) . Photolysis was performed using a Spectra 
5 Physics GCR-150-30 Nd-YAG laser that generates light 
pulses at 355 nm (ca. 50 mJ/pulse, pulse length ca. 7 
ns) at a frequency of 30 Hz with a light intensity at 
ca. 1.5 W/cm 2 . Thermo Sequenase DNA Polymerase, HIV-1 
and RAV2 Reverse Transcriptases were obtained from 
Amersham Biosciences. Therminator, Vent (exo-) , Deep 
Vent (exo-) , Bst and Klenow (exo-) fragment DNA 
Polymerases were obtained from New England Biolabs. 9°n 
Polymerase (exo-) A485L/Y409V was generously provided 
by New England Biolabs. Sequenase V2 DNA Polymerase, M- 
15 MulV and AMV Reverse Transcriptases were obtained from 
DSB Corporation (Cleveland, OH) . Tfl and Tth DNA 
Polymerases were obtained from Promega Corporation 
(Madison, WI) . Pfu (exo-) DNA Polymerase was obtained 
from Stratagene, Inc. (La Jolla, CA) . Phosphoramidites 
and columns for nucleic acid synthesis were obtained 
from Glen Research (Sterling, VA) . 

Synthesis of a 3'-0-allyl modified 19-mer 
oligonucleotide. 

25 

3'-0-allyl-thymidine phosphoramidite 3, prepared 
according to Fig. l was used to synthesize a 19-mer 
oligonucleotide 5' -AGA-GGA-TCC-AAC-CGA-GAC-T (allyl) -3' 
4 (SEQ ID NO:l)(MW = 5871). The synthesis was carried 
out in the 5' to 3' direction using 3 along with dA-5'- 
CE, dC-5'-CE, dG-5'-CE and dT-5'-CE phosphoramidites 
and a dA-5'-CPG column. The oligonucleotide was 
purified by HPLC using an Xterra MS C18 (4.6 x 50 mm) 
column (Waters) . The elution was performed over 90 min 
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at a flow rate of 0.5 ml/min and a fixed temperature of 
50 °C using a linear gradient (12 - 34.5%) of methanol 
in a buffer containing 8.6 mM triethylamine and 100 mM 
hexaf luoroisopropyl alcohol (pH = 8.1)- ' The product 
5 was characterized using MALDI-TOF MS. 

Deallylation reaction performed using the 3'-0-allyl 
modified 19— mer oligonucleotide. 

10 For the deallylation reaction, 55 equivalents of 
Na2PdCl4 and 440 equivalents of a trisodium 
triphenylphosphinetrisulfonate (TPPTS) ligand were used 
in water at 70 °C. Na 2 PdCl 4 in degassed water (0.7 \xl, 
2.2 nmol) was added to a solution of TPPTS in degassed 

15 water (1 |il, 17.6 nmol) and mixed well. After 5 min, a 
solution of 3' -O-allyl modified oligonucleotide 4 (1 
jjil, 4 0 pmol) was added. The reaction mixture was then 

■ 

placed in a heating block at 7 0 °C and incubated for 30 
seconds. The resulting deallylated product was desalted 

2 0 by Zip Tip (Millipore Corporation) and analyzed using 

MALDI-TOF MS. 

Primer extension reaction performed with the 
deallylated DNA product. 

25 The 10 p.1 extension reaction mixture consisted of 45 
pmol of the deallylated DNA product as a primer, 100 
pmol of a single-stranded synthetic 100-mer DNA 
template (sequence shown in reference 15) corresponding 
to a portion of exon 7 of the p53 gene, 100 pmol .of 

3 0 Biotin-11-2 ' , 3' -dideoxyguanosine-5' - triphosphate 

(Biotin-ll-ddGTP) terminator (Perkin Elmer) , IX Thermo 
Sequenase reaction buffer and 4 U of Thermo Sequenase 
DNA Polymerase. The extension reaction consisted of 15 
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cycles at 94 °C for 20 sec, 48 °C for 30 sec and 60 °C 
for 60 sec. The product was purified using solid phase 
capture on s treptavidin-coated magnetic beads (25), 
desalted using Zip Tip and analyzed using MALDI-TOF MS. 
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Synthesis of 3' -O-allyl-dUTP-PC-Bodipy-FL-510 . 

3'-0-allyl-dUTP-PC-Bodipy-FL-510 10 was synthesized as 
shown in Fig. 2. Detailed synthesis procedures and 
characterization data for all intermediate compounds 
10 (6-9) are described in the supporting information. 

* 

PC-Bodipy-FL-510 NHS ester (13) (7.2 rag, 12 mmol) in 
300 ial of acetonitrile was added to a solution of 3 • -o- 
allyl-5- (3-aminoprop-l-ynyl) -2 ' -deoxyuridine-5 ' - 
15 triphosphate 9 (2 mg, 4 mmol) in 300 ul of Na 2 C0 3 -NaHC0 3 
buffer (0.1 M, pH 8.7). The reaction mixture was 
stirred at room temperature for 3 h. A preparative 
silica-gel TLC plate was used to separate the unreacted 
PC-Bodipy-FL-510 NHS ester from the fractions 
20 containing 10 (CHCI3/CH3OH, 85/15) . The product was 
concentrated further under vacuum and purified with 
reverse-phase HPLC on a 150 ' 4 . 6-mm C18 column to 
obtain the pure product 10 (retention time = 35 min) . 
Mobile phase:. A, 8 . 6 mM triethylamine/100 mM 
25 hexafluoroisopropyl alcohol in water (pH .8.1); B, 
methanol. Elution was performed with 100% A isocratic 
over 10 min followed by a linear gradient of 0-50% B 
for 20 min and then 50% B isocratic over another 
20 min. 3 ' -O-allyl-dUTP-PC-Bodipy-FL-510 10 was 

characterized by the following single base extension 
reaction and MALDI-TOF MS. 
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Primer extension using 3' -O-allyl-dUTP-PC-Bodipy-FL-510 
and photocleavage of the extension product. 
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An 18-mer oligonucleotide 5' -AGA-GGA-TCC-AAC-CGA-GAC-3' 
(SEQ ID NO: 2) (MW = 5907) was synthesized using dA-CE, 
dC-CE, dG-CE and Biotin-dT phosphoramidites . A primer 
extensi6n reaction was performed using a 15 (1 reaction 
mixture consisting of 50 pmol of primer, 100 pmol of 
single-stranded synthetic 100-mer DNA template 
corresponding to a portion of exon 7 of the p53 gene 
(15), 200 pmol of 3'-O-allyl-dUTP-PC-Bodipy-FL-510, IX 
Thermopol reaction buffer (New England Biolabs) and 15 
U of 9(N Polymerase (exo-) A485L/Y409V. The extension 
reaction consisted of 15 cycles of 94 (C for 20 sec, 48 
(C for 30 sec and 60 (C for 60 sec. A small portion of 
the DNA extension product 11 was desalted using Zip Tip 
and analyzed using MALDI-TOF MS. The rest of the 
product was f reeze-dried, resuspended in 200 (1 of 
deionized water and irradiated for 10 sec in a quartz 
cell with path lengths of 1.0 cm employing an Nd-YAG 
laser ((-355 nm) to cleave the fluorophore from the 
DNA, yielding product 12 . 

Deallylation of the DNA extension product generated by 
the incorporation of 3' -O-allyl-dUTP-PC-Bodipy-FL-510 . 

The above photocleaved 3'-0-allyl modified DNA product 
12 (180 pmol produced in multiple reactions) was dried 
and resuspended in 1 (1 of deionized H20. Na2PdC14 in 
degassed H20 (4.1 (1, 72 nmol) was added to a solution 
of TPPTS in degassed H20 (2.7 (1, 9 nmol) and mixed 
well. After 5 min, the above DNA product (1 (1, igo 
pmol) was added. The reaction mixture was then placed 
in a heating block, incubated at 70 °C for 90 sec to 
yield deallylated product 13, and then cooled to room 
temperature for analysis by MALDI-TOF MS. 
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Polymerase extension and photocleavage using the 
deallylated DNA product as a primer. 

The above deallylated DNA product 13 was used as a 
primer in a single base extension reaction. The 10 (1 
reaction mixture consisted of 50 pmol of the above 
deallylated product 13, 125 pmol of dGTP-PC-Bodipy-FL- 
510 (14), 4 U of Thermo Sequenase DNA Polymerase and IX 
reaction buffer. The extension reaction consisted of 15 
cycles of 94 (C for 20 sec, 48 (C for 30 sec and 60 (C 
for 60 sec. The DNA extension product 14 was desalted 
using the Zip Tip protocol and a small portion was 
analyzed using MALDI-TOF MS. The remaining product was 
then irradiated with near UV light for 10 sec to cleave 
the fluorophore from the extended DNA product. The 
resulting photocleavage product 15 was desalted and 
analyzed using MALDI-TOF MS. 



2 0 Discussion 

It is shown here that an allyl moiety can be 
successfully used as a blocking group for the 3' -OH of 
a photocleavable fluorescent nucleotide analogue in SBS 
to prevent the DNA polymerase reaction from continuing 

25 after the incorporation of the 3'-0-allyl modified 
nucleotide analogue. Furthermore, it is demonstrated 
that the allyl group can be efficiently removed to 
generate a free 3' -OH group and allow the DNA 
polymerase reaction to continue to the subsequent 
3 0 cycle. 

Conventional methods for cleavage of the allyl group 
combine a transition metal-catalyzed isomerization of 
the double bond to the enol ether and subsequent 
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hydrolysis of the latter to produce the corresponding 
alcohol (26, 27) . For application in SBS, it is 
important to ensure that complete chemical cleavage of 
the 3'-0-allyl group can be rapidly and specifically 
5 carried out while leaving the DNA intact. Trisodium 
triphenylphosphinetrisulfonate (TPPTS) has been widely 
used as a ligand for Pd mediated deallylation under 
aqueous conditions (28-30), while an active Pd catalyst 
can be generated from Na2PdC14 and an appropriate 
10 ligand (31, 32) . Thus, a water-soluble Pd catalyst 
system generated from Na 2 PdCl 4 and TPPTS was 
investigated for deallylation of the 3'-0-allyl 
modified DNA product. 

15 To evaluate the cleavage conditions of the allyl group 
capping the 3' -OH of DNA, first a 19-mer 
oligonucleotide [ 5' -AGAGGATCCAACCGAGAC-T (allyl) -3' ] 

(SEQ ID NO: 3) was synthesized using 3'-0-allyl- 
thymidine phosphoramidite (Fig. 3) . The identity of the 
2 0 purified oligonucleotide was established using MALDI- 
TOF mass spectrometry. Then the above Na 2 PdCl 4 /TPPTS 
catalyst system was tested for the deallylation of the 
oligonucleotide. In Fig. 3A, the mass peak at m/z 5871 
corresponds to the mass of the purified oligonucleotide 
2 5 bearing the allyl group. Fig. 3B shows a single mass 
peak at m/z 5831 indicating that near complete 
deallylation was achieved with a DNA/Na 2 PdCl 4 /TPPTS 
ratio of 1/55/440 in a reaction time of 30 seconds. The 
next step was to prove that the above deallylated DNA 
product could be used as a primer in a polymerase 
extension reaction. A single base extension reaction 
was performed using the deallylated DNA product as a 
primer, a synthetic template and a Biotin-ll-ddGTP 
nucleotide terminator which was complementary to the 
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base immediately adjacent to the priming site on the 
template. The DNA extension product was isolated using 
solid phase capture purification and analyzed using 
MALDI-TOF MS (25) . The mass spectrum in Fig. 3C shows a 
5 clear peak at m/z 6535 corresponding to the extension 
product indicating that the deallylated product can be 
successfully used as a primer in a polymerase reaction. 

The above experiments established that Na 2 PdCl 4 and 
10 TPPTS could be used to efficiently carry out 
deallylation on DNA in an aqueous environment. Our next 
step was to investigate if a 3' -O-allyl-modif ied 
nucleotide could be incorporated in a DNA polymerase 
reaction. For this purpose, a nucleotide analogue 3' -CD- 
IS allyl-thymidine triphosphate (3' -O-allyl-dTTP) was 
synthesized which was tested with 15 different 
polymerases for incorporation. The tested enzymes 
included Therminator , Thermo Sequenase, Vent (exo-) , 
Deep Vent (exo-), Tth, Tfl, Bst, Pfu (exo-), Klenow 
2 0 (exo-) fragment and Sequenase DNA Polymerases, AMV, 
RAV2, M-MulV, HIV reverse transcriptases and a 9°N 
Polymerase (exo-) bearing the mutations A48 5L and 
Y4 0 9V. Our preliminary results showed that 9°N DNA 
polymerase (exo-) . A485L/Y409V could efficiently 
25 incorporate 3 ' -O-allyl-dTTP in an extension reaction, 
consistent with results reported recently (31) . 

After confirming the incorporation ability of 3'-0- 
allyl-dTTP into a growing DNA strand by DNA polymerase, 
a new 3' -modified photocleavable fluorescent nucleotide 
analogue was synthesized, 3' -O-allyl-dUTP-PC-Bodipy-FL- 
510, according to Fig. 2, and it was established that 
the analogue can also can be efficiently incorporated 
by the above polymerase. The aim was to evaluate that 
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the presence of the bulky photocleavable fluorescent 

moiety on the base and the allyl group on the 3' end of 

the nucleotide analogue would not affect the polymerase 

extension reaction. Furthermore, demonstration of an 

5 entire cycle of primer extension was desirable , 

photocleavage of the fluorophore, deallylation followed 

by extension with another photocleavable fluorescent 

nucleotide complementary to the next base on the 

template and photocleavage once again. This experiment 

10 will thus test the feasibility of using 3'-0-allyl- 

dUTP-PC~Bodipy-FL-510 as a reversible terminator for 
SBS . 
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The entire cycle of a polymerase reaction using 3'-0- 
15 allyl-dUTP-PC-Bodipy-FL-510 as a reversible terminator 
is depicted in Fig. 5. The extension product 11 
obtained using 3' -O-allyl-dUTP-PC-Bodipy-FL-510 and 9°N 
DNA Polymerase (exo-) A485L/Y409V was purified using 
HPLC and analyzed using MALDI-TOF MS. The base in the 
template immediately adjacent to the priming site was 
*A' . Thus, if 3'-O-allyl-dUTP-PC-Bodipy-FL-510 was 
accepted by the polymerase as a terminator, the primer 
would extend by one base and then the reaction would 
terminate. Our results indicate that this was indeed 
the case . After confirming that the extension reaction 
was successful, it was irradiated with near UV light at 
355 nm for 10 seconds to cleave the fluorophore from 
the DNA, generating product 12. In an SBS system, this 
step would ensure that there would be no carryover of 
the • fluorescence signal into the next incorporation 
cycle so as to prevent the generation of ambiguous data 
at each step, as shown in the accompanying paper (15) . 
The photocleavage product 12 was then incubated with a 
Na 2 PdCl 4 /TPPTS catalyst system at 70 °C for 90 seconds 
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to perform deallylation. The deallylated DNA product 
13 was purified by reverse phase HPLC and then 
used as a primer in a second DNA extension reaction to 
prove that the regenerated 3' -OH was capable of 
5 allowing the polymerase reaction to continue. For the 
extension reaction, a photocleavable fluorescent 
nucleotide dGTP-PC-Bodipy-FL-510 was used and Thermo 
Sequenase DNA polymerase. The extension product 14 was 
irradiated as above, for 10 seconds to generate 
photocleavage product 15 and hence complete an entire 
reversible termination cycle. 
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After each step in the above cycle, a portion of the 
product was purified and analyzed using MALDI-TOF MS to 
15 confirm its identity and the successful completion of 
that step. Each product was desalted using the Zip Tip 
desalting protocol to ensure the generation of sharp 
and well-resolved data free from salt peaks. The MALDI- 
TOF MS data for. each step are shown in Fig. 5. Fig. 5A 
shows the primer extension product 11 at m/z 67 87 
generated using 3' -O-allyl-dUTP-PC-Bodipy-FL-510 . The 
peak at m/z 62 92 corresponds to the photocleavage 
product that was generated by the partial photocleavage 
of the extension product due to the nitrogen laser 
25 (A,~337 nm) used for ionization of the analyte in MALDI- 
TOF MS. Fig. 5B shows the photocleavage result after 
the 10-second irradiation of the extension product at 
355 nm. It can be seen from the data that the peak at 
m/z 67 87, corresponding to the extension product has 
30 completely vanished and only a single peak 
corresponding to 12 remains at m/z 6292, which proves 
that photocleavage was efficiently achieved. Fig. 5C 
shows a similar single peak at m/z 6252, which 
corresponds to the deallylated photocleavage product 
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13. The absence of a significant peak at m/z 6292 
proves that deallylation was completed with high 
efficiency. Fig. 5D shows the MALDI-TOF MS data for the 
extension product obtained using the above deallylated 
5 DNA product 13 as a primer and nucleotide analogue 
dGTP-PC-Bodipy-FL-510. A dominant peak is seen at m/z 
7133 corresponding to the extension product 14. 
Finally, Fig. 5E shows a clear peak at m/z 6637 
corresponding to the photocleavage product 15 and no 
significant peak at m/z 7133 indicating that complete 
photocleavage had occurred. 
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The results of the above experiments provide sufficient 
proof of the feasibility of using the allyl group as a 
reversible capping moiety for the 3' -OH of the 
photocleavable nucleotide analogues for SBS . It is 
shown that a 3'-0-allyl modified nucleotide bearing a 
photocleavable fluorophore is an excellent substrate 
for 9°N DNA polymerase A485L/Y409V and can be 
incorporated with high efficiency in a polymerase 
extension reaction. It is also demonstrated that 
complete photocleavage is achieved in -10 seconds on 
these DNA products. Furthermore, it is shown that 
deallylation can be swiftly achieved to near completion 
25 under mild reaction conditions in an aqueous 
environment using a palladium catalyst. Finally, it is 
have established that the deallylated DNA product can 
be used as a primer to continue the polymerase reaction 
and that extension and photocleavage can be performed 
with high efficiency. These findings confirm that an 
allyl moiety protecting the 3' -OH group indeed bestows 
the capability of reversible terminating abilities to 
photocleavable nucleotide analogues, which can be used 
for SBS. Further efforts are being focused on 
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generating four nucleotide analogues (A, C, G and T) , 
each with a distinct photocleavable fluorophore and 
with a 3'-0-allyl capping group. These nucleotides will 
facilitate the development of SBS for high-throughput 
5 DNA sequencing and genotyping applications. 



Example 2 

1 0 Synopsis 

DNA sequencing by synthesis (SBS) using reversible 
fluorescent nucleotide terminators is a potentially 
efficient approach to address the limitations of 
current DNA sequencing techniques. Here, the design and 
15 synthesis of a complete set of four-color 3'-0-allyl 
modified photocleavable fluorescent nucleotides as 
reversible terminators for SBS is described. The 
nucleotides are efficiently incorporated by DNA 
polymerase into a growing DNA strand to terminate the 
polymerase reaction. After that the fluorophore is 
photocleaved quantitatively by irradiation at 355 nm, 
and the allyl group is rapidly and efficiently removed 
by using a Pd-catalyzed reaction under DNA compatible 
conditions to regenerate a free 3' -OH group to 
reinitiate the polymerase reaction. A homopolymeric 
region of a DNA template was successfully sequenced 
using these 3'-0-allyl modified nucleotide analogues, 
facilitating the development of SBS as a viable 
approach for high-throughput DNA sequencing 
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Introduction 



The design and synthesis of a complete set of four- 
color 3'-0-allyl modified photocleavable fluorescent 
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nucleotides, 3' -O-allyl-dGTP-PC-Bodipy-FL-510, 3' -o- 
allyl-dCTP-PC-Bodipy-650, 3' -0-allyl-dUTP-PC-R6G and 
3'-0-allyl-dATP-PC-ROX, is disclosed here, as shown in 
Fig. 6. Their applications as reversible terminators 
5 for SBS are also described here, demonstrating the 
base-specific incorporation of these nucleotide 
analogues by DNA polymerase, the highly efficient 
photocleavage of the fluorescent dye, and the rapid and 
complete removal of 3' -O-allyl . group in a Pd-catalyzed 
10 reaction under DNA compatible conditions. Previously, 
the Pd-catalyzed deallylation to regenerate a free 3'- 
OH of the DNA extension product was carried out in pure 
water (34) which can destabilize the primer-template 
duplex. A new condition for rapid quantitative 
15 deallylation in a buffer solution at pH 8.8 has been 
identified here, which is commonly used in a polymerase 
reaction. The successful synthesis of these 3' -O-allyl 
modified photocleavable fluorescent nucleotides as 
reversible terminators to sequence through a 
2 0 homopolymer sequence, and the identification of the new 
deallylation condition will facilitate the development 
of SBS as a viable approach for de novo DNA sequencing. 

As an example, 3 ' -O-allyl-dGTP-PC-Bodipy-FL-510 (10) is 
25 used here to illustrate the detailed synthesis strategy 
and procedures. To the applicants' knowledge, using 3'- 
modified dGTP as a reversible terminator for SBS has 
not been reported, partly due to the difficulty of 
modifying 3' -OH of guanosine by a suitable capping 
30 group without protecting the guanine base. Structure 10 
was prepared following a synthesis route as shown in 
Fig. 7. 
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2-amino-4-methoxy-7- O-D-2-deoxyribofuranosyl) 
pyrrolo[2,3-d]-pyrimidine 1 was chosen as the starting 
material for the synthesis of 3' -O-allyl-dGTP 9 (Fig. 
7) (see (35)). Structure 1 was first protected by 
5 isobutyryl chloride to form 2 quantitively (4) . 
Structure 2 was iodinized with NIS to afford a single 
iodo product 3 in 84% yield, as the protected 2-amino 
group in the heterocyclic ring promotes the formation 
of 7-substituted product (5) . Structure 3 was 
10 deprotected to afford 4 in 91% yield by sodium 
methoxide. The 5' -OH of 4 was protected by tert- 
butyldimethylsilyl chloride to yield 5 in 88% yield 
(6). The 3' -OH of 5 was subsequently allylated in 
CH 2 C1 2 and 4 0% aqueous NaOH solution using 
tetrabutylammonium bromide as phase-transfer catalyst 
to give a 92% yield of 6 without 2-N-allylated product. 
Cross-coupling reaction of 6 with the terminal alkyne 
catalyzed by Pd(0)/Cu(I) formed 7 in 94% yield (7). 
Next, a one-pot procedure of demethylation (8) and 
desilylation of 7 gave a moderate 34% yield of 8. 
Finally structure 8 was transformed into the 
corresponding triphosphate 9 following established 
procedures (9) . Coupling 9 with PC-Bodipy-FL-510 NHS 
ester (2) yielded the target compound, 3' -O-allyl-dGTP- 
25 PC-Bodipy-FL-510, structure 10. 

* 

3'-0-allyl-dATP-PC-ROX 19 was also prepared, as were 
3'-O-allyl-dCTP-PC-Bodipy-650 26 and 3' -O-allyl-dUTP- 
PC-R6G 33, as shown in Figs. 8, 9, and 10, 
30 respectively. 

For 3'-0-allyl modified PC fluorescent nucleotides to 
act as reversible terminators for SBS, it is important 
to establish' that they can be used to determine a 
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repeated DNA sequence in a polymerase reaction. To this 
end, polymerase DNA extension reactions were performed 
using these nucleotides as substrates in solution. This 
allows the isolation of the DNA product at each step of 
SBS for detailed molecular structure characterization 
by using MALDI-TOF mass spectrometry (MS) . 

3'-0-allyl-dGTP-PC-Bodipy-FL-510 (structure 10) was 
used as a substrate in a DNA extension reaction as 
shown in Fig. 11. a synthetic 100-mer DNA corresponding 
to a portion of exon 7 of the human p53 gene was used 
as a template to perform the extension. The sequence in 
the template immediately adjacent to the annealing site 
of the primer had a repeating sequence of 3'-CC-5' 
First, a polymerase extension reaction using 10 as a 
terminator along with a primer and the above template 
was performed. After the reaction, a small portion of 
the DNA extension product was characterized by MALDI- 
TOF MS. The rest of the product was irradiated at 355 
nm for 10 sec to cleave the fluorophore from the DNA 
and then analyzed by MALDI-TOF MS. After photocleavage , 
the DNA product was added to a deallylation cocktail 
[IX Thermopol reaction buf f er/Na 2 PdCl 4 /-P (PhS0 3 Na) 3 ] to 
remove the 3'-Allyl group in 30 sec to yield 
25 quantitatively deallylated DNA product. The deallylated 
DNA product with a free 3' -OH group regenerated was 
then used as a primer to incorporate 10 in a subsequent 
second polymerase extension reaction. 
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Fig. 12 (right panel) shows sequential mass spectrum at 
each step of DNA sequencing by synthesis using 10 as a 
reversible terminator. As can be seen from Fig. 12, 
panel (A) , the MALDI-TOF MS spectrum consists of a 
distinct peak at m/z 7,052 corresponding to the single 
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base DNA extension product 34 with 100% incorporation 
efficiency, confirming that the reversible terminator 
10 can be incorporated base-specifically by DNA 
polymerase into a growing DNA strand. The small peak at 
5 m/z 6,556 corresponding to the photocleavage product is 
due to the partial cleavage caused by the nitrogen 
laser pulse (337 nm) used in MALDI ionization. Fig. 12, 
panel (B) shows the photocleavage result after 10 sec 
irradiation of the DNA extension product at 355 nm. The 
10 peak at m/z 7,052 has completely disappeared, whereas 

■ 

the peak corresponding to the photocleavage product 35 
appears as the sole dominant peak at m/z 6,556. Fig. 
12, panel (C) shows a single peak at m/z 6,516, which 
corresponds to a deallylated photocleavage product 36. 

15 The absence of a peak at m/z 6,556 proves that the 
deallylation reaction was completed with high 
efficiency. The next extension reaction was carried out 
by using this deallylated photocleavage product 36 as a 
primer along with 3' -O-allyl-dGTP-PC-Bodipy-FL-510 (10) 

20 to yield an extension product 37 (Fig. 12, panel D) . 
DNA products (38 and 39) from photocleavage (Fig. 12, 
panel E) and deallylation (Fig. 12, panel F) 
respectively were obtained in similar manner as 
described previously, thereby completing two entire 

2 5 polymerase extension cycles to sequence a homopolymeric 
region of a template using 10 as a reversible 
terminator . 

3' -O-allyl-dATP-PC-ROX 19, mixed together with 3'-0- 
30 allyl-dGTP-PC-Bodipy-FL-510 10/3' -O-allyl-dCTP-PC- 

Bodipy-650 26/3' -0-allyl-dUTP-PC-R6G 33, was used as a 
reversible terminator in a primer extension reaction as 
shown in Fig. 13 (left panel) . After the incorporation, 
photocleavage and deallylation reactions were performed 
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on the DNA extension product, following a similar 
procedure as for 10. Fig. 13 (right panel, A) shows, the 
MOLDI-TOF MS results for the characterization of the 
product from each step. In the extension reaction, all 
four 3'-0-allyl modified photocleavable fluorescent 
nucleotides were added simultaneously instead of using 
only the correct one. The MS showed that only 3'-0- 
allyl-dATP-PC-ROX 19, the one complementary with the 
template sequence, was successfully incorporated in 
this extension reaction, as demonstrated by the single 
major peak at m/z 7, 228 and a partial photocleavage 
peak at m/z 6,495. There is no other DNA extension 
product observed, indicating a faithful incorporation 
of the 3'-0-allyl modified nucleotide. The MS results 
also demonstrated that the photocleavage and 
deallylation steps were successfully conducted as shown 
in Fig. 13 (right panel, B and C) with peaks at m/z 
6,495 and 6,455. 



20 3' -O-allyl-dCTP-PC-Bodipy-650 26, mixed together with 
3' -O-allyl-dGTP-PC-Bodipy-FL-510 10/3' -O-allyl-dATP-PC- 
ROX 19/3' -0-allyl-dUTP-PC-R6G 33, was used in a primer 
extension reaction and then photocleavage and 
deallyation reactions were performed on the DNA 

25 extension product, as shown in Fig. 14 (left panel) . 
Fig. 14 (right panel, A) shows the successful 
incorporation of 26, among the four nucleotide 
analogues, by the DNA polymerase to generate a single 
DNA extension product 43 at m/z 8,532. Subsequently, 

30 photocleavage was conducted to generate a photocleavage 
product 44 at m/z 7, 7 62 , and deallyation product 45 
was observed at m/z 1,122, as shown in Fig. 14 (right 
panel, B and C) , respectively. 
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Similarly, 3' -0-allyl-dUTP-PC-R6G 33, mixed together 
with 3' -O-allyl-dGTP-PC-Bodipy-FL-510 10/3' -O-allyl- 
dATP-PC-ROX 19/3'-0-allyl-dCTP-PC-Bodipy-650 26, also 
showed successful incorporation by a DNA polymerase in 
5 a primer extension reaction, as indicated by the single 
extension product (46) peak at m/z 6,210 in MALDI-TOF 
MS spectrum in Fig. 15 (right panel, A) . The 
fluorescent dye was then photocleaved to generate a 
photocleavage product 47 at m/z 5,552, and 3'-0-allyl 
10 was removed in a Pd-catalyzed reaction to generate a 
deallylated product 48 at m/z 5,512, as shown in Fig. 
15 (right panel, B and C) , respectively. 



15 Material and Methods 

General Information 1 H NMR spectra were recorded on 
Brucker DPX-400 (400 MHz) and Brucker DPX-300 
spectrometers and are reported in ppm from CD 3 OD or 

2 0 DMSO-d6 internal standard (3.31 or 2.5 0 ppm 
respectively). Data are reported as follows: (s - 
singlet, d = doublet, t = triplet, q = quartet, m = 
multiplet, dd = doublet of doublets, ddd = doublet of 
doublets of doublets; coupling constant (s) in Hz; 

2 5 integration; assignment) . Proton decoupled 13 C NMR 
spectra were recorded on a Brucker DPX-400 (100 MHz) 
spectrometer and are reported in ppm from CD 3 OD, DMSO- 
d6, or CDC1 3 internal standard (49.0, 39.5, or 77.0 ppm 
respectively) . Proton decoupled 31 P NMR spectra were 

30 recorded on a Brucker DPX-300 (121.4 MHz) spectrometer 
without calibration. High Resolution Mass Spectra 
(HRMS) were obtained on a JEOL JMS HX 11 OA mass 
spectrometer. Mass measurement' of DNA was made on a 
Voyager DE MALDI-TOF mass spectrometer (Applied 
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Biosystems ) . Photolysis was performed by using a 
Spectra Physics GCR-150-30 Nd-yttrium/aluminum garnet 
laser that generates light pulses at 355 nm. Compounds 
1 and 11 were purchased from Berry & Associates 
5 (Dexter, MI) . Bodipy-FL-510 NHS ester, ROX NHS ester, 
Bodipy-650 NHS ester and R6G NHS ester were purchased 
from Invitrogen (Carlsbad, CA) . All other chemicals 
were purchased from Sigma-Aldrich . 9°N polymerase (exo- 
). A485L/Y409V was generously provided by New England 
10 Biolabs. 



I. Synthesis of 3'-0-allyl modified photocleavable 
fluorescent nucleotides . 

15 

1) Synthesis of 3' -O-allyl-dGTP-PC-Bodipy-FL-510 as 
shown in Fig. 7. 

2- (2-Methylpropanoyl) amino-7- [3' ,5' -hls-O- (2- 
2 0 methylpropanoyl) -p-D~2' -deoxyribof uranosyl ] -4- 

methoxypyrrolo [2 , 3-d]pyrimidine (2): To a stirred 
suspension of 1 (1.00 g; 3.57 mmol) in anhydrous 
pyridine (35 mL) was added slowly isobutyryl chloride 
(3.40 mL; 32.2 mmol) at 0 °C . The reaction mixture was 
25 stirred at 0 °C for 1 h. Methanol (2 mL) was then added 
and the reaction mixture was stirred for another 10 
min. Then most solvent was removed under vacuum. Ethyl 
acetate (200 mL) and saturated aqueous NaHC0 3 (50 mL) 
were added to the residue. The organic layer was 
30 separated and washed by saturated aqueous NaHCC>3 and 
NaCl respectively, and dried over anhydrous Na 2 S0 4 . 
After evaporation of the solvent, the residue was 
purified by flash column chromatography over silica gel 
using ethyl acetate-hexane (1:3-2) as the eluent to 
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afford 2 as white foam (1.75 g; 99% yield): X H NMR (400 

MHz , CD 3 OD) 5 7.28 <d, J = 3.7 Hz, 1H, 6-H) , 6.66 (dd, 
J= 5.9, 8.6 Hz, 1H, l'-H), 6.51 (d, J = 3.7 Hz, 1H, 5- 
H) , 5.41 (m, 1H, 3'-H), 4.33-4.36 (m, 2H, 5'-H), 4.22 
5 (m, 1H, 4'-H), 4.08 (s, 3H, 4-OCH 3 ) / 2.83-2.96 (m, 2H, 
one of CH(CH 3 ) 2 and one of 2'-H), 2.54-2.70 (m, 2H, two 
of CH(CH 3 ) 2 ), 2. 48-2 .54 (ddd, J = 2.0, 5.9, 14.2 Hz, 
one of 2'-H), 1.15-1.23 (m, 18H, CH(CH 3 ) 2 )/ 13 C NMR (100 
MHz, CD3OD) 5 178.2, 177.7, 177.4, 164.2, 153.4, 152.5, 
10 123.4, 103.5, 100.7, 85.2, 83.0, 75.9, 65.0, 54.4, 
37.9, 36.6, 35.0, 34.9, 19.9 (two CH 3 ) , 19.3-19.4 (four 
peaks for four CH 3 ) ; HRMS (FAB+) calcd for C 2 4H 35 0 7 N4 
(M+H + ) : 491.2506, found: 491.2503. 

15 2- (2-Methylpropanoyl) ami no- 7- [3' ,5' -Jbxs-O- (2- 

methylpropanoyl) - p-D-2 ' -deoxyribo-f uranosyl] -5-iodo-4- 
methoxypyrrolo [2 , 3-d] pyrimidine (3): To a vigorously 
stirred solution of 2 (1.75 g; 3.57 mmol) in anhydrous 
DMF (27 mL) was added 95% N-iodosuccimide (NIS) (8 66 

2 0 mg; 3.66 mmol) . The reaction mixture was stirred at 
room temperature for 22 h, and then most solvent was 
removed under vacuum. Diethyl ether (200 mL) and 
saturated aqueous NaHC0 3 (50 mL) were added. The 
organic layer was separated and washed by saturated 

25 aqueous NaCl, and dried over anhydrous Na 2 S0 4 . After 
evaporation of the solvent, the residue was purified by 
flash column chromatography over silica gel using ethyl 
acetate-hexane (1:3) as the eluent to afford 3 as white 

solid (1.98 g; 90% yield): X H NMR (400 MHz, CD 3 OD) 5 
30 7.43 (s, 1H, 6-H), 6.63 (dd, J= 6.0, 8.2 Hz, 1H, I'- 
ll), 5.41 (ra, 1H, 3'-H), 4.33-4.36 (m, 2H, 5'-H), 4.23 
(m, 1H, 4'-H), 4.09 (s, 3H, 4-OCH 3 ) , 2.78-2.94 (m, 2H, 
one of CH(CH 3 )2 and one of 2'-H), 2.57-2.70 (m, 2H, two 
of CH(CH 3 ) 2 ), 2.50-2.57 (ddd, J = 2.3, 6.0-, 14.2 Hz, 
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one of 2'-H), 1.17-1.24 (m, 18H, CH(CH 3 ) 2 ); 13 C NMR (100 
MHz , CD 3 OD) 5 178.3, 177.8, 177.5, 164.3, 153.3, 152 .8, 
128.6, 105.2, 85.3, 83.3, 75.8, 65.0, 54.4, 51.8, 38.2, 
36.8, 35.2, 35.1, 19.9 (two CH 3 ) , 19.3-19.5 (four peaks 
5 for four CH 3 ) ; HRMS (FAB+) calcd for C24H34O7N4I (M+H + ) : 
617.1472, found: 617.1464. 

2 -Amino- 7- ([3-D-2 ' -deoxyr ibof ur anosy 1 ) -5-iodo-4- 
methoxypyrrolo [2 , 3-d] pyrimi dine (4): 3 (1.98 g; 3.21 

10 mmol) was dissolved in 0.5 M methanolic CH 3 ONa (50 mL) 
and stirred at 65 °C for 12 h. Saturated aqueous NaHC0 3 
(20 mL) was added and the mixture was stirred for 10 
min. Then most of methanol was evaporated and the 
residue was extracted by ethyl acetate (150 mL) . The 

15 organic layer was washed by saturated aqueous NaHC0 3 
and NaCl respectively, and dried over anhydrous Na 2 S0 4 . 
After evaporation of the solvent, the residue was 
purified by flash column chromatography over silica gel 
using CH 3 OH-CH 2 Cl 2 (1:30-15) as the eluent to afford 4 

20 as white solid (1.23 g; 94% yield): X H NMR (400 MHz, 
CD 3 OD) 5 7.17 (s, 1H, 6-H) , 6.36 (dd, J = 6.0, 8.4 Hz, 
1H, l'-H), 4.47 (m, 1H, 3'-H), 3.99 (s, 3H, 4-OCH 3 ) , 
3.96 (m, 1H, 4'-H), 3.77 (dd, J — 3.4, 12.0 Hz, 1H, one 
of 5'-H), 3.70 (dd, J = 3.7, 12.0 Hz, 1H, one of 5'-H), 

25 2.55-2.64 (ddd, J = 6.0, 8.4, 13.4 Hz, one of 2'-H), 
2.20-2.26 (ddd, J- = 2.4, 5.9, 13.4 Hz, one of 2'-H); 
13 C NMR (100 MHz , CD 3 OD) 5 164.7, 160.6, 154.3, 126.5, 
101.6, 88.7, 86.0, 73.0, 63.7, 53.7, 51.3, 41.1; HRMS 
(FAB+) calcd for Ci2Hi 6 0 4 N 4 I (M+H + ) : 407.0216, found: 

30 407.0213. 

2 -Amino- 7- [p-D-5 ' -O- ( tert-butyldimethylsilyl) -2' - 
deoxyribof uranosyl ] -5-iodo-4-methoxypyrrolo [2 , 3- 
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d]pyr±midine (5): To a stirred solution of 4 (1.23 g; 
3.02 mmol) and imidazole (4 94 mg; 7.24 ramol) in 
anhydrous DMF (15 mL) was added tert-butyldimethylsilyl 
chloride (TBDMSC1) (545 mg; 3.51 mmol). The reaction 
5 mixture was stirred at room temperature for 20 h. Then 
most solvent was removed under vacuum, and the residue 
was purified by flash column chromatography over silica 
gel using ethyl acetate-hexane (1:2-0.5) as the eluent 
to afford 5 as white foam (1.38 g; 88% yield): 1 H NMR 

10 (400 MHz, CD3OD) 5 7.23 (s, 1H, 6-H) , 6.49 (dd, J = 
6.1, 7.7 Hz, 1H, l'-H), 4.46 (m, 1H, 3'-H), 3.99 (s, 
3H, 4-OCff 3 ), 3.94 (m, 1H, 4'-H), 3.79-3.87 (m, 2H, 5'- 
H), 2.36-2.44 (ddd, J = 5.8, 7.7, 13.3 Hz, one of 2'- 
H), 2.24-2.31 (ddd, J" - 3.1, 6.0, 13.3 Hz, one of 2'- 

15 H), 0.96 (s, 9H, C(CH 3 ) 3 ), 0.14 (s, 3H, one of SiCtf 3 ) , 
0.13 (s, 3H, one of SiCH 3 ) ; 13 C NMR (100 MHz, CD 3 OD) 5 
164.6, 160.7, 154.7, 125.1, 101.0, 88.2, 84.2, 72.7, 
64.7, 53.7, 51.7, 41.9, 26.7, 19.4, -5.0, -5.1; HRMS 
(FAB+) calcd for C 18 H 30 O4N 4 SiI (M+H+) : 521.1081, found: 

20 521.1068. 

7- [P-D-3' -O-Allyl-5 » -O- ( tejrt-butyldimethylsilyl) -2' - 
deoxyribof ur ano sy 1 ] - 2 - amino - 5 - iodo - 4 - 

methoxypyrrolo [2 f 3-d] pyrimidine (6): To a stirred 
25 solution of 5 (1.38 g; 2.66 mmol) in CH 2 C1 2 (80 mL) 
were added tetrabutylammonium bromide (TBAB) (4 37 mg; 
1.33 mmol), allyl bromide (1.85 mL, 21.4 mmol) and 40% 
aqueous NaOH solution (40 mL) . The reaction mixture was 
stirred at room temperature for lh. Ethyl acetate (200 
30 mL) was added and the organic layer was separated. The 
aqueous layer was extracted with ethyl acetate (2x50 
mL) . The combined organic layer was washed by saturated 
aqueous NaHC0 3 and NaCl respectively, and dried over 
anhydrous Na 2 S0 4 . After evaporation of the solvent, the 
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residue was purified by flash column chromatography 
over silica gel using ethyl acetate-hexane (1:3) as the 
eluent to afford 6 as white solid (1.37 g; 92% yield): 
X H NMR (400 MHz, CD3OD) 5 7.20 (s, 1H, 6-H) , 6.43 (dd f J 
5 « 6.2, 7.9 Hz, 1H, l'-H) , 5.89-5.99 (m, 1H, CH 2 CH=CH 2 ) , 
5.2 9-5.35 (dm, J = 17.3 Hz, 1H, one of CH 2 CH=Ctf 2 ) , 
5.16-5.21 (dm, J = 10.5 Hz, 1H, one of CH 2 CH==CH 2 ) , 4.24 
(m, 1H, 3'-H), 4.01-4.11 (m, 3H, 4'-H and CH 2 CH=CH 2 ) , 
3.99 (s, 3H, 4-OCH 3 ), 3.76-3.84 (m, 2H, 5'-H), 2.32- 
10 2.44 (m, 2H, 2'-H), 0.95 (s, 9H, C(CH 3 ) 3 ), 0.14 (s, 3H, 
one of S±CH 3 ) , 0.13 (s, 3H, one of SiCH 3 ) ; 13 C NMR (100 
MHz, CDCI3) 8 163.3, 158.6, 153.6, 134.1, 123.7, 116.9, 

* 

100.6, 84.4, 83.0, 79.1, 70.0, 63.6, 53.3, 51.1, 38.1, 
26.1, 18.5, -5.1, -5.3; HRMS (FAB+) calcd for 
15 C21H34O4N4SH (M+H + ) : 561.1394, found: 561.1390. 



7- [p-D-3' -O-Allyl-5 ' -O- ( tert-butyldimethylsilyl) -2' - 
deoxyribofuranosyl] -2-amino-5- [3- 
[ (trifluoroacetyl) amino] -prop-l-ynyl] -4- 

20 methoxypyrrolo[2 , 3-d] pyrimi dine (7): To a stirred 
solution of 6 (1.37 g; 2.45 mmol) in anhydrous DMF (11 
mL) were added tetrakis (triphenylphosphine) palladium( 0) 
(286 mg; 0.245 mmol) and Cul (101 mg; 0.532 mmol). The 
solution was stirred at room temperature for 10 min. 

25 Then N-propargyltrif luoroacetamide (1.12 g; 7.43 mmol) 
and triethylamine (0.68 mL; 4.90 mmol) were added. The 
reaction was stirred at room temperature for 13 h with 
exclusion of air and light. Most DMF was removed under 
vacuum and the residue was dissolved in ethyl acetate 

30 (100 mL) . The solution was washed by saturated aqueous 
NaHC03 and NaCl respectively, and dried over anhydrous 
Na 2 S0 4 . After evaporation of the solvent, the residue 
was purified by flash column chromatography over silica 
gel using ethyl acetate-hexane (1:3-1.5) and CH 3 OH- 
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CH 2 C1 2 (1:30) respectively as the eluent to afford 7 as 

yellow solid (1.34 g; 94% yield): X H NMR (400 MHz, 

CD3OD) 5 7.34 (s, 1H, 6-H) , 6.42 (dd, J = 6.2, 7.7 Hz, 

1H, l'-H), 5.88-5.99 (m, 1H, CH 2 Ctf=CH 2 ) , 5.28-5.35 (dm, 
5 lJ = 17.3 Hz, 1H, one of CH 2 CH=CH 2 ) , 5.16-5.21 (dm, J = 

10.5 Hz, 1H, one of CH 2 CH=Ctf 2 ) , 4.29 (s, 2H, C=CCH 2 ) , 
4.24 (m, 1H, 3'-H), 4.00-4.09 (m, 3H, 4'-H and 
Ctf 2 CH=CH 2 ) , 3.98 (s, 3H, 4-OCH 3 ) , 3.76-3.84 (m, 2H, 5'- 
H) , 2.32-2.45 (m, 2H, 2'-H), 0.94 (s, 9H, C(Ctf 3 ) 3 ), 
10 0.12 (s, 3H, one of SiCH 3 ) , 0.11 (s, 3H, one of SiCH 3 ) ; 

13 C NMR (100 MHz, CD3OD) 5 165.0, 161.2, 158.1 (q, J = 
36 Hz, COCF3) , 154.2, 135.6, 125.0, 117.2 (q, J = 284 
Hz, COCF3) , 117.0, 99.2, 97.3, 86.0, 84.6, 84.5, 80.3, 
78.0, 71.0, 64.8, 53.8, 39.0, 30.9, 26.5, 19.3, -5.1, - 
15 5.2; HRMS (FAB+) calculated for C 26 H 3 705N5F 3 Si (M+H + ) : 
584.2516, found: 584.2491. 

* 

3' -0-Allyl-7-deaza-7- [3- [ ( tr if luoroace tyl ) amino] -prop- 
1-ynyl] -2 1 -deoxyguanosine (8): To a stirred solution of 

20 7 (1.34 g; 2.30 mmol) in anhydrous CH 3 CN (86 mL) were 
added Nal (363 mg; 2.42 mmol) and chlorotrimethylsilane 
(TMSC1) (0.306 mL; 2.42 mmol). The reaction was stirred 
at room temperature for 1 h and then at 50 °C for 12 h. 
The solvent was evaporated and the residue was 

25 dissolved in anhydrous THF (7 6 mL) . 1 M 
tetrabutylammonium fluoride (TBAF) in THF solution 
(4.80 mL; 4.80 mmol) was added and the reaction was 
stirred at room temperature for 1 h. The solvent was 
evaporated and the residue was dissolved in ethyl 

30 acetate (150 mL) . The solution was washed by saturated 
aqueous NaCl and dried over anhydrous Na 2 S0 4 . After 
evaporation of the solvent, the residue was purified by 
flash column chromatography over silica gel using 
CH 3 OH-ethyl acetate (1:30) as the eluent to afford 8 as 
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yellow solid (356 mg; 34% yield) : X H NMR (400 MHz, 
CD 3 OD) 8 7.21 (s, 1H, 6-H), 6.30 (dd, J= 6.0, 8.4 Hz, 
1H, l'-H), 5.88-5.99 (m, 1H, CH 2 Ctf=CH 2 ) , 5.28-5.35 (dm, 
j = 17.3 Hz, 1H, one of CH 2 CH=CH 2 ) , 5.15-5.20 (dm, J = 
5 10.5 Hz, 1H, one of CH 2 CH=CH 2 ) , 4.29 (s, 2H, C=CCff 2 ) , 
4.23 (m, 1H, 3'-H), 4.00-4.10 (m, 3H, 4'-H and 
CH 2 CH=CH 2 ), 3.65-3.75 (m, 2H, 5'-H), 2.41-2.4 9 (ddd, J 
= 5.8, 8.4, 13.6 Hz, 1H, one of 2'-H), 2.34-2.40 (ddd, 
j — 2.3, 6.0, 13.6 Hz, 1H, one of 2'-H); 13 C NMR (100 
10 MHz, CD3OD) 5 160.9, 158.0 (q, J — 36 Hz, COCF 3 ) , 154.1, 
151. 8, 135.6, 124.4, 117.2 (q, J = 284 Hz, COCF3) , 
117.0, 101.4, 99.7, 86.4, 85.5, 84.8, 80.7, 78.0, 71.0, 
63.7, 38.5, 31.2; HRMS (FAB+) calcd for Ci9H 2 i0 5 N 5 F3 
(M+H + ) : 456.1495, found: 456.1493. 

15 

3' -0-Allyl-7-deaza-7- ( 3-aminoprop-l-ynyl) -2 ' - 
deoxyguanosine-5' - triphosphate (9): The procedure is 
the same as that of preparing 3' -O-allyl-5- (3- 
aminoprop-l-ynyl) -2 1 -deoxyuridine-5' -triphosphate in 

20 Ref. 34a to yield 9 as colorless syrup: X H NMR (300 
MHz, D z O) 5 7.56 (s, 1H, 6-H), 6.37 (t, J — 7.3 Hz, 1H, 
l'-H), 5.89-6.02 (m, 1H, CH 2 Ctf=CH 2 ) , 5.31-5.39 (dm, J = 
17.3 Hz, 1H, one of CH 2 CH=CH 2 ) , 5.21-5.28 (dm, J — 10.5 
Hz, 1H, one of CH 2 CH=CH 2 ) , 4.49 (s, 2H, CsCCH 2 ) , 4.32 

25 (m, 1H, 3'-H), 4.06-4.18 (m, 3H, 4'-H and CH 2 CH=CH 2 ) , 
3.92-3.99 (m, 2H, 5'-H), 2,44-2.60 (m, 2H, 2'-H); 31 P 
NMR (121.4 MHz, D 2 0) 5 -6.1 (d, J= 20.8 Hz, IP, y-P) , - 
10.8 (d, <J = 18.9Hz, IP, Ct-P), -21.9 (t, J= 19.8 Hz, 

IP, P-P) - 



30 



3' -O-Allyl-dGTP-PC-Bodipy-FL-510 (10): PC-Bodipy-FL-510 
NHS ester (prepared by the same procedure in Ref. 34a) 
(7.2 mg, 12 nmol) in 300 uL of acetonitrile was added 
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to a solution of 9 (2 mg, 3.4 pmol) in 300 uL of 
Na 2 C0 3 -NaHC0 3 aqueous buffer (0-1 M, pH 8.5). The 
reaction mixture was stirred at room temperature for 3 
h. A preparative silica-gel TLC plate was used to 
5 separate the unreacted PC-Bodipy-FL-510 NHS ester from 
the fraction containing 10 with CHCI3-CH3OH (85:15) as 
the eluent . The product was concentrated further under 
vacuum and purified with reverse-phase HPLC on a 150 x 
4 . 6-itim CI 8 column to obtain the pure product 10 

10 (retention time of 34 min) . Mobile phase: A, 8 . 6 mM 
triethylamine/100 mM hexaf luoroisopropyl alcohol in 
water (pH 8.1); B, methanol. Elution was performed with 
100% A isocratic over 10 min, followed by a linear 
gradient of 0-50% B for 20 min and then 50% B isocratic 

15 over another 20 min. 3 T — O— allyl— dGTP— PC— Bodipy— FL-510 
10 was characterized by primer extension reaction and 
MALDI-TOF MS . 



20 2) Synthesis of 3' -O-Allyl-dATP-PC-ROX as shown in Fig. 
8. 

4-Chloro-5-iodopyrrolo[2,3-d]pyrimidine (12) : To a 

vigorously stirred solution of 11 (1.0 g; 6.51 mmol) in 
25 CH 2 C1 2 (55 ml) was added 95% N-iodosuccimide (1.70 g; 
7.18 mmol). The reaction mixture was stirred at room 
temperature for 1 h, during which time more precipitate 
appeared. The solid was filtered and recrystallized in 
hot methanol to afford 12 as slightly grey crystals 

30 (1.49 g; 82% yield): l H NMR (400 MHz, DMSO-d6) 8 12.96 
(s br, 1H, Ntf) , 8.59 (s, 1H, 2-H) , 7.94 (s, 1H, 6-H) ; 
13 C NMR (100 MHz, DMSO-d6) 8 151.2, 150.4, 150.2, 133.6, 
115.5, 51.7; HRMS (FAB+) calod for C 6 H 4 N3C1I (M+H + ) : 
279.9139, found: 279.9141. 
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10 



4-Chloro-7- O-D-2' -deoxyr ibof uranosyl ) -5- 
iodopyrrolo[2,3-d]pyrimidine (13) : To a stirred 
solution of 12 (597 mg; 2.14 mmol) in CH 3 CN (36 mL) 
5 were added KOH powder (0.30 g; 5.36 mmol) and tris [2- 
(2-methoxyethoxy) ethyl] amine (44 uL, 0.14 mmol). The 
mixture was stirred at room temperature for 10 min and 
then 90% 3,5-di-0-(p-toluyl)-2-deoxy-D-ribofuranosyl 
chloride (1.00 g; 2.31 mmol) was added. The reaction 
was stirred vigorously at room temperature for 1 h, and 
the insoluble material was filtered and washed by hot 
acetone. The combined solution was evaporated and 
dissolved in 7M methanolic ammonia (72 mL') . The 
solution was stirred at room temperature for 24 h. 
After evaporation of the. solvent, the residue was 
purified by flash column chromatography over silica gel 
using CH 3 OH-CH 2 Cl 2 (0-1:20) as the eluent to afford 13 
as white solid (711 mg; 84% yield): 2 H NMR (400 MHz. 
CD3OD) 8 8.57 (s, 1H, 2-H), 8.08 (s, 1H, 6-H) , 6.72 
(dd, J = 6.3, 7.5 Hz, 1H, l'-H), 4.53 (m, 1H, 3'-H), 
4.00 (m, 1H, 4'-H), 3.80 (dd, J= 3.6/ 12.0 Hz, 1H, one 
of 5'-H), 3.74 (dd, J = 3.6, 12.0 Hz, 1H, one of 5'-H), 
2.56-2.64 (ddd, J = 6.1, 7.5, 13.5 Hz, 1H, one of 2'- 
H), 2.36-2.43 (ddd, J= 3.3, 6.2, 13.5 Hz, 1H, one of 
25 2'-H); 13 C NMR (100 MHz, CD3OD) S 152.9, 151.7, 151.3, 
134.7, 118.5, 89.0, 85.7, 72.6, 63.2, 52.6, 41.7; HRMS 

(FAB+) calcd for CnH^OaNaClI . (M+H + ) : 395.9612, found: 
395 . 9607 . 



15 



20 



30 7- [p-D-5' -o- ( ter-fc-Butyldimethylsilyl) -2' - 

deoxyribof uranosyl] -4-chloro-5-iodopyrro-lo [2 , 3- 

d] pyrimidine (14): The procedure is the same as that of 

5 and the crude was purified by flash column 



( f' 
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chromatography over silica gel using ethyl acetate- 
hexane (1:3-2) as the eluent to afford 14 as white 
solid (65% yield) and 30% of the starting material 13: 

X H NMR (400 MHz, CD 3 OD) 5 8.56 (s, 1H, 2-H) , 7.99 (s, 

5 1H, 6-H), 6.73 (t, J = 6.7 Hz, 1H, I'-H), 4.52 (m, 1H, 

3'-H), 4.02 (m, 1H, 4'-H), 3.92 (dd, J= 3.0, 11.4 Hz, 

1H, one of 5'-H), 3.86 (dd, J « 3.1, 11.4 Hz, 1H, one 

of 5'-H), 2.47-2.55 (ddd, J= 5.8, 7.1, 13.4 Hz, 1H, 

one of 2'-H), 2.40-2.47 (ddd, l7= 3.6, 6.3, 13.4 Hz, 

10 1H, one of 2'-H), 0.94 (s, 9H, C(CH 3 ) 3 )/ 0.14 (s, 3H, 

one of . SiCff 3 ) , 0.13 (s, 3H, one of SiCH 3 ) ; 13 C NMR (100 

MHz, CD3OD) 5 152.8, 151.5, 151.3, 133.8, 118.2, 88.9, 

« 

85.4, 72.5, 64.6, 52.6, 42.4, 26.7, 19.5, -4.9, -5.0; 
HRMS (FAB+) calcd for Ci 7 H 2 60 3 N 3 ClSiI (M+H + ) : 510.0477, 
15 found: 510.0487. 



7- [p-D-3' -O- Ally 1-5' -O- ( tert-butyl dimethyl si lyl) -2' - 
deoxyribof uranosyl ] -4-chloro-5-iodopyrrolo [2,3- 
d] pyrimidine (15) : The procedure is the same as that of 
2 0 6 and the crude was purified by flash column 
chromatography over silica gel using ethyl acetate- 
hexane (1:6) as the eluent to afford 15 as yellow oil 

(752 mg; 95% yield): 1 H NMR (400 MHz, CD 3 OD) 5 8.52 (s, 
1H, 2-H), 7.94 (s, 1H, 6-H), 6.64 (dd, J = 6.1, 7.6 Hz, 

25 1H, l'-H), 5.88-5.99 (m, 1H, CH 2 Cff=CH 2 ) , 5.28-5.34 (dm, 
J = 17.3 Hz, 1H, one of CH 2 CH=Ci* 2 ) , 5.16-5.21 (dm, J = 
10.4 Hz, 1H, one of CH 2 CH=CH 2 ) , 4.28 (m, 1H, 3'-H), 
4.13 (m, 1H, 4'-H), 4.01-4.11 (m, 2H, CH 2 CH=CH 2 ) , 3.88 
(dd, J = 3.6, 11.2 Hz, 1H, one of 5'-H), 3.80 (dd, J — 

30 3.1, 11.3 Hz, 1H, one of 5'-H), 2.51-2.57 (ddd, J" = 
2.7, 6.0, 13.5 Hz, 1H, one of 2'-H), 2.42-2.50 (ddd, J 
= 5.7, 7.7, 13.5 Hz, 1H, one of 2'-H), 0.93 (s, 9H, 
C(CH 3 ) 3 ), 0.13 (s, 3H, one of SiCH 3 ) , 0.12 (s, 3H, one 



( 
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of SiCJf 3 ) ; 13 C NMR (100 MHz, CD 3 OD) 8 152.8, 151.4, 
151.3, 135.5, 133.6, 118.2, 117.2, 86.5, 85.6, 80.2, 
71.0, 64.8, 52.8, 39.7, 26.7, 19.4, -4.8, -5.0; HRMS 
(FAB+) calcd for C20H30O3N3CISH (M+H + ) : 550.0790, found: 
5 550.0773. 

3, -o-Allyl-7-deaza-7-iodo-2' -deoxyadenosine (16): To a 

stirred solution of 15 (752 mg; 1.37 mmol) in anhydrous 
THF (32 mL) was added 1 M TBAF in THF solution (1.50 

10 mL; 1.50 mmol) and the reaction was stirred at room 
temperature for 1 h. The solvent was evaporated and the 
residue was dissolved in 7 M methanolic ammonia (22 
mL) . The solution was stirred in an autoclave at 115- 
12 0 °C for 17 h. After evaporation of the solvent, the 

15 residue was purified by flash column chromatography 
over silica gel using CH 3 OH-CH 2 Cl 2 (1:20) as the eluent 
to afford 16 as white solid (479 mg; 84% yield): X H NMR 
(400 MHz, CD3OD) 8 8.08 (s, 1H, 2-H) , 7.56 (s, 1H, 6- 
H) , 6.45 (dd, J= 5.8, 8.6 Hz, 1H, l'-H), 5.90-6.00 (m, 

20 1H, CH 2 CH=CH 2 ), 5.29-5.35 (dm, J = 17.2 Hz, 1H, one of 
CH 2 CH=Ctf 2 ), 5.16-5.21 (dm, J = 10.5 Hz, 1H, one of 
CH 2 CH=CHz) , 4.28 (m, 1H, 3'-H), 4.12 (m, 1H, 4'-H), 
4.02-4.12 (m, 2H, CH 2 CH=CH 2 ) , 3.7 8 (dd, J = 3.7, 12.1 
Hz, 1H, one of 5'-H), 3.70 (dd, J = 3.6, 12.1 Hz, 1H, 

25 one of 5'-H), 2.53-2.61 (ddd, j — 5.8, 8.6, 13.6 Hz, 
1H, one of 2'-H), 2.41-2.47 (ddd,' J = 2.0, 5.8, 13.5 
Hz, 1H, one of 2'-H); 13 C NMR (100 MHz, CD3OD) 8 158.5, 
152.3, 150.3, 135.7, 128.8, 117.0, 105.3, 86.8, 86.4, 
80.7, 71.0, 63.7, 51.3, 38.8; HRMS (FAB+) calcd for 

30 C14H18O3N4I (M+H + ) : 417.0424, found: 417.0438. 

3' -0-Allyl-7-deaza-7- [3- [ (trif luoroacetyl) amino] -prop- 
1-ynyl] -2' -deoxyadenosine (17): The procedure is the 
same as that of 7 and the crude product was purified by 
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flash column chromatography over silica gel using ethyl 
acetate-hexane (1:1-0) as the eluent to afford 17 as 
yellow solid (455 mg; 90% yield) : l H NMR (400 MHz, 
CD 3 OD) 5 8.08 (s, 1H, 2-H) , 7.60 (s, 1H, 6-H) , 6.41 
(dd, J = 5.8, 8.6 Hz, 1H, l'-H), 5.89-6.00 (m, 1H, 
CH 2 Ctf=CH 2 ) , 5.29-5.35 (dm, J = 17.3 Hz, 1H, one of 
CH 2 CH=Ctf 2 ) , 5.16-5.21 (dm, J = 10.4 Hz, 1H, one of 
CH 2 CH=Ctf 2 ) , 4.31 (s, 2H, C=CCH 2 ) , 4.2 9 (m, 1H, 3'-H), 
4.13 (m, 1H, 4'-H), 4.01-4.11 (m, 2H, CH 2 CH=CH 2 ) , 3.79 
(dd, J = 3.6, 12.1 Hz, 1H, one of 5'-H), 3.71 (dd, J = 
3.5, 12.1 Hz, 1H, one of 5'-H), 2.54-2.62 (ddd, J = 
5.8, 8.6, 13.6 Hz, 1H, one of 2'-H), 2.42-2.48 (ddd, J 
= 1.9, 5.8, 13.6 Hz, 1H, one of 2'-H); 13 C NMR (100 
MHz, CD3OD) 8 158.8, 158.6 (q, J = 38 Hz, COCF3) , 152.9, 
149.6, 135.6, 128.1, 117.1 (q, J - 284 Hz, COCF 3 ) , 
117.0, 104.5, 96.3, 87.3, 86.9, 86.8, 80.7, 77.0, 71.0, 
63.8, 38.7, 31.1; HRMS (FAB+) calcd for Ci9H 2 i0 4 N 5 F 3 
(M+H + ) : 440.1546, found: 440.1544. 

ft 

20 3' -0-Allyl-7-deaza-7- ( 3 - aminopr op- 1 -y ny 1 ) -2' - 

deoxyadenosine-5' -triphosphate (18): The procedure is 
the same as that of preparing 9 to yield 17 as 
colorless syrup: X H NMR (300 MHz, D 2 0) 8 8.02 (s, 1H, 2- 
H) , 7.89 (s, 1H, 6-H), 6.54 (t, J = 6.6 Hz, 1H, l'-H), 
25 5.89-6.02 (m, 1H, CH 2 CH=CH 2 ) , 5.30-5.39 (dm, J = 17.3 
Hz, 1H, one of CH 2 CH=CH 2 ) , 5.20-5.27 (dm, J = 10.4 Hz, 
1H, one of CH 2 CH=CH 2 ) , 4.48 (s, 2H, C=CCH Z ) , 4.35 (m, 
1H, 3'-H), 4.05-4.17 (m, 4H, Ctf 2 CH=CH 2 and 5'-H), 3.99 
(m, 1H, 4'-H), 2.50-2.59 (m, 2H, 2'-H); 31 P NMR (121.4 
30 MHz, D 2 0) 8 -6.1 (d, J= 21.1 Hz, IP, y-P) , -10.8 (d, J 
m 18.8 Hz, IP, oc-P) , -21.9 (t, J= 19.9 Hz, IP, p-P) . 
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3* -o-allyl-dATP-ROX (19): The coupling reaction of 18 
with PC-ROX-NHS ester (Ref. 2b) afforded 19, following 
a similar procedure as the preparation of 10. 3 1 -0- 
allyl-dATP-PC-ROX 19 was characterized by the primer 
5 extension reaction and MALDI-TOF MS. 



3) Synthesis of 3' -O-Allyl-dCTP-PC-Bodipy-650 as shown 
in Fig. 9. 

10 

5 / _o- ( teirt-Butyldimethylsilyl) -5-iodo-2 ' -deoxycytidine 
(21) : The procedure is the same as that of 5 and the 
crude product was purified by flash column 
chromatography over silica gel using CH3OH-CH2CI2 (1:20) 

15 as the eluent to afford 21 as white solid (1.18 g; 8 9% 
yield): X H NMR (400 MHz, CD3OD) 5 8.18 (s, 1H, 6-H) , 
6.17 (dd, J = 5.8, 7.5 Hz, 1H, l'-H), 4.34 (m, 1H, 3'- 
H) , 4.04 (m, 1H, 4'-H), 3.93 (dd, J - 2.5, 11.6 Hz, 1H, 
one of 5'-H), 3.84 (dd, J - 2.9, 11.6 Hz, 1H, one of 

20 5'-H), 2.41-2.48 (ddd, l7= 2.5, 5.8, 13.5 Hz, 1H, one 
of 2'-H), 2.01-2.08 (ddd, J= 5.9, 7.6, 13.5 Hz, 1H, 
one of 2'-H), 0.95 (s, 9H, C(CH 3 ) 3 ), °- 17 < s ' 3H ' one 
of SiCH 3 ), 0.16 (s, 3H, one of SiCH 3 ) ; 13 C NMR (100 MHz, 
CD 3 OD) 8 165.5, 156.8, 147.8, 89.4, 88.3, 72.8, 64.6, 

25 57.1, 43.1, 26.7, 19.4, -4.8, -4.9; HRMS (FAB+) calcd 
for Ci 5 H 2 70 4 N 3 SiI (M+H + ) : 468.0816, found: 468.0835. 



3' -O-Allyl-5' -O- (text-butyldimethylsilyl) -5-iodo-2' - 
deoxycytidine (22) : To a stirred solution of 21 (1.18 
30 g; 2.52 mmol) in anhydrous THF (43 itiL) was added 95% 
NaH powder (128 mg; 5.07mmol). The suspension was 
stirred at room temperature for 45 min. Allyl bromide 
(240 ]iL, 2.79 mmol) was then added at 0 °C and the 
reaction was stirred at room temperature for 14 h with 
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exclusion of moisture. Saturated aqueous NaHC0 3 (10 mL) 
was added at 0 °C and stirred for 10 min. Most THF was 
evaporated and the residue was dissolved in ethyl 
acetate (150 mL) . The solution was washed by saturated 
5 aqueous NaHC0 3 and NaCl respectively, and dried over 
anhydrous Na 2 S0 4 . After evaporation of the solvent, the 
residue was purified by flash column chromatography 
over silica gel using ethyl acetate as the eluent to 
afford 22 as white solid (537 mg; 42% yield) : X H NMR 
10 (400 MHz, CD 3 OD) 8 8.15 (s, 1H, 6-H) , 6.12 (dd, J = 
5.6, 8.0 Hz, 1H, l'-H), 4.17 (m, 1H, 4'-H), 4.14 (m, 
1H, 3'-H), 3.98-4.10 (m, 2H, CH 2 CH=CH Z ) , 3.93 (dd, J = 
2.8, 11.5 Hz, 1H, one of 5'-H), 3.83 (dd, J = 2.8, 11.5 
Hz, 1H, one of 5'-H), 2.53-2.60 (ddd, J = 1.7, 5.6, 
15 13.6 Hz, 1H, one of 2'-H), 1.94-2.02 (ddd, J = 5.9, 
8.0, 13.6 Hz, 1H, one of 2'-H), 0.94 (s, 9H, C(CH 3 ) 3 ), 
0.17 (s, 3H, one of SiCH 3 ) , 0.16 (s, 3H, one of SiCff 3 ) ; 
13 C NMR (100 MHz, CD 3 0D) 8 165.4, 156.7, 147.7, 135.5, 
117.2, 88.2, 87.0, 80.4, 70.9, 64.8, 57.3, 40.1, 26.7, 
20 19.4, -4.7, -4.9; HRMS (FAB+) calcd for Ci 8 H 3 i0 4 N 3 SiI 
• (M+H + ) : 508.1129, found: 508.1123. 

3/ -o-allyl-5-iodo-2' -deoxycytidine (23): To a stirred 
solution of 22 (537 mg; 1.06 mmol) in anhydrous THF (25 

25 mL) was added 1 M TBAF in THF solution (1.17 mL; 1.17 
mmol) and the reaction was stirred at room temperature 
for 1 h. The solvent was evaporated and the residue was 
dissolved in ethyl acetate (100 mL) . The solution was 
washed by saturated aqueous NaCl and dried over 

30 anhydrous Na 2 S0 4 . After evaporation of the solvent, the 
residue was purified by flash column chromatography 
over silica gel using CH 3 OH-CH 2 Cl 2 (1:10) as the eluent 
to afford 23 as white crystals (287 mg; 69% yield) : X H 
NMR (400 MHz, CD 3 OD) 8 8.47 (s, 1H, 6-H), 6.15 (dd, J = 
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6-2, 6.7 Hz, 1H, 1<-H), 5.87-5.98 < m , lH/ CH 2 C^CH 2 ) , 
5.26-5.33 (dm, J = 17. 2 Hz, 1H, one of CH 2 CH=CH 2 ) , 
5.14-5.19 (dm, J — 10.5 Hz, 1H, one of CH 2 CH=Cff 2 ) , 4.18 
<», 1H, 3'-H), 4.08 (m, 1H, 4'-H), 3.98-4.10 (m, 2H 
Ctf 2 CH=CH 2 ), 3.82 (dd, J= 3.2, 13.0 Hz, 1H, one of 5'- 
H), 3.72 (dd, J= 3.3, 13.0 Hz, 1H, one of 5'-H), 2.44- 
2.51 (ddd, J = 3.2, 6.0, 13.6 Hz, 1 H , one of 2'-H), 
2.07-2.15 (m, 1H, one of 2'-H); » C NMR (100 MHz, CD3OD) 
5 165.4, 156.9, .148.8, 135.6, 117.0, 87.9, 86.9, 79.6, 
71-2, 62.7, 57.2, 39.7; HRMS ( FAB+ ) calcd for C 12 H 17 0 4 N 3 i 
(M+H + ) : 394. 0264, found: 394.0274. 



3' -O-Allyl-5- [3- [ (trifluoroacetyl) amino] -prop-l-ynyl] - 
2 ' -deoxycy tidine (24): The procedure is the same as 
15 that of 7 and the crude product was purified by flash 
column chromatography over silica gel using CH 3 OH-CH 2 Cl 2 
(0-1:10) as the eluent to afford 24 as yellow crystals 
(252 mg; 83% yield): *H NMR (400 MHz, CD3OD) 8 8.31 (s 
1H, 6-H), 6.17 (dd, J- 6.0, 7.3 Hz, 1H, l'-H), 5.87- 
20 5.97 (m, 1H, CH 2 CH=CH 2 , , 5.26-5.33 (dm, J - 17.3 Hz 
1H, one of CH 2 CH=Ctf 2 ) , 5.15-5.19 (dm, J - 10.4 Hz, in, 
one of CH 2 CH=Ctf 2 ), 4.31 (s, 2H, C-CCtf.) , 4.17 (m, m, 
3'-H), 4.09 (m, 1H, 4'-H), 3.98-4. 10 ' (m, 2H 
Ctf 2 CH=CH 2 ), 3.80 (dd, jr- 3.4, 12.0 Hz, 1H, one of 5'- 
25 H), 3.72 (dd, j- 3.6, 12.0 Hz, 1H, one of 5'-H), 2.46- 
2.53 (ddd, J = 2.9, 5.3, 13.6 Hz, 1 H , one of 2'-H), 
2.04-2.12 (rn, 1 H/ one of 2'-H); 13 C NMR (100 MHz, CD3OD) 
6 166.0, 158.4 (q, j- 38 Hz, COCF3) , 156.3, 145.8, 
135.6, 117.1 (q, j- 284 Hz, COCF 3 ) , H 7 . 0, 91.9, 90.7, 
0 88.0, 87.0, 79.8, 75.5, 71.2, 62.8, 39.6, 31.0,-HRMS 
(FAB+) calcd for C 17 H 20 O 5 N 4 F 3 (M+H + ) : 417.1386, found: 

1 1 ""7 "1 *"i *~j •■"» 



417.1377 
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3' -O-Allyl-5- (3-aminoprop-l-ynyl) -2' -deoxycytidine 5' 

■ 

triphosphate (25) : The procedure is the same as that of 
preparing 9 to yield 25 as colorless syrup: X H NMR (30 0 
MHz, D z O) 5 8.43 (s, 1H, 6-H) , 6.21 (t, J = 6.7 Hz, 1H, 
5 l'-H), 5.85-6.00 (m, 1H, CH 2 Cff=CH 2 ) , 5.28-5.38 (dm, J = 
17.3 Hz, 1H, one of CH 2 CH=Ctf 2 ) , 5.19-5.27 (dm, J = 10.4 
Hz, 1H, one of CH 2 CH=CH 2 ) , 4.22-4.41 (m, 3H, 3'-H and 
C=CCH 2 ) , 4.05-4.18 (m, 3H, 4'-H and Ctf 2 CH=CH 2 ) , 3.94- 
4.01 (m, 2H, 5'-H), 2.47-2.59 (m, 1H, one of 2'-H), 
2.20-2.32 (m, 1H, one of 2'-H); 31 P NMR (121.4 MHz, D 2 0) 
8 -7.1 (d, J = 19.8 Hz, IP, y-P) , -11-1 (d, J = 19.1 
Hz, IP, a-P) , -21.9 (t, J= 19.5 Hz, IP, (3-P) . 

3'O-allyl-dCTP-PC-Bodipy-650 (26) : The coupling 

15 reaction of 25 with PC-Bodipy-650-NHS ester (Ref. 34b) 
afforded 26, following a similar procedure as the 
preparation of 10. 3 ' -O-allyl-dCTP-PC-Bodipy-650 26 was 
characterized by the primer extension reaction and 
MALDI-TOF MS . 



4) Synthesis of 3' -0-allyl-dUTP-PC-R6G as shown in 
Figure 10 . 



5 r _ 0 - (terfc-butyldimethylsilyl) -5-iodo-2' -deoxyuridine 
(28) : The procedure is the same as that of 5 and the 
crude product was purified by flash column 
chromatography over silica gel using CH 3 OH-CH 2 Cl 2 (1:20) 
as the eluent to afford 28 as white solid (1.18 g; 8 9% 
yield): X H NMR (400 MHz, CD3OD) 6 8.17 (s, 1H, 6-H), 
6.21 (dd, J = 5.9, 7.9 Hz, 1H, l'-H), 4.36 (m, 1H, 3'- 
H) , 4.02 (m, 1H, 4'-H), 3.93 (dd, J = 2.4, 11.5 Hz, 1H, 
one of 5'-H), 3.85 (dd, J = 2.9, 11.5 Hz, 1H, one of 
5'-H), 2.30-2.37 (ddd, J = 2.3, 5.8, 13.4 Hz, 1H, one 
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of 2'-H), 2.08-2.15 (ddd, J*= 5.9, 7.9, 13.4 Hz, 1H, 
one of 2'-H), 0.96 (s, 9H, C(CH 3 ) 3 ), 0.19 (s, 3H, one 
of SiCH 3 ) , 0.17 (s, 3H, one of SiCH 3 ) - 13 C NMR (100 MHz, 

CD 3 OD) 8.162.4, 151.5, 145.8, 89.3, 87.2, 72.8, 68.7, 
5 64.6, 42.3, 26.8, 19.5, -4.8, -4.9. HRMS (FAB+) Calcd 
for Ci 5 H2605N 2 SiI (M+H + ) : 469.0656, found: 469.0672. 



3' -O-allyl-5' -O- ( fceir t-butyl dime thy lsilyl) -5-iodo-2' - 
deoxyuridine (29) : The procedure is the same as that of 
10 22 and the crude product was purified by flash column 
chromatography over silica gel using ethyl acetate- 
hexane (1:2.5) as the eluent to afford 29 as white 

solid (1.03 g; 80% yield). X H NMR (400 MHz, CD3OD) 5 
8.15 (s, 1H, 6-H) , 6.15 (dd, J= 5.6, 8.3 Hz, 1H, 1' - 
15 H) , 5.87-5.97 (m, 1H, CH 2 CJf-CH 2 ) , 5.27-5.33 (dm, J - 
17.3 Hz, 1H, one of CH 2 CH==Ctf 2 ) / 5.16-5.21 (dm, J = 10.4 
Hz, 1H, one of CH 2 CH==CH 2 ) , 4.13-4.18 (m, 2H, 3'-H and 
4'-H), 3.99-4.10 (m, 2H, CH 2 CH=CH 2 ) , 3.92 (dd, J = 2.7, 

11.5 Hz, 1H, one of 5'-H), 3.84 (dd, J — 2.7, 11.5 Hz, 
20 1H, one of 5'-H), 2.43-2.49 (ddd, J - 1.7, 5.6, 13.6 

Hz, 1H, one of 2'-H), 2.02-2.10 (ddd, J" = 5.6, 8.4, 

13.6 Hz, 1H, one of 2'-H), 0.96 (s, 9H, C(CH 3 ) 3 ), 0 ■ 18 
(s, 3H, one of SiCH 3 ) , 0.17 (s, 3H, one of SiCH 3 ) . 13 C 

NMR (100 MHz, CD 3 OD) 5 162.3, 151.4, 145.5, 135.5, 
25 117.2, 87.0, 86.8, 80.3, 70.9, 69.0, 64.8, 39.4, 26.8, 
19.4, -4.7, -4.8. HRMS (FAB+) Calcd for Ci 8 H 3 o0 5 N 2 SiI 
(M+H + ) : 509.0969, found: 509.0970. 

3' -O-allyl-5' -O- ( tex-t-butyldimethylsilyl) -5- [3- 
30 [ (trifluoroacetyl) amino] -prop- 1 -yny 1 ] -2' -deoxyuridine 

(30) : The procedure is the same as that of 7 and the 
crude product was purified by flash column 
chromatography over silica gel using CH 3 0H-CH 2 C1 2 

(0-1:40) as the eluent to afford 30 as yellow crystals 
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(786 mg; 73% yield). X H NMR (400 MHz, CD 3 OD) 5 8.11 ( S/ 
1H, 6-H), 6.18 (dd, J = 5.8, 7.9 Hz, 1H, l'-H), 5.87- 
5.97 (m, 1H, CH 2 CH=CH 2 ), 5.27-5.33 (dm, J — 17. 2 Hz, 
1H, one of CH 2 CH=CH 2 ) , 5. 16-5/21 (dm, J = 10.4 Hz, 1H, 
5 one of CH 2 CH=Ctf 2 ) , 4.27-4.32 (dd, J - 17.7 Hz, 1H, one 
of C^CCife), 4.21-4.27 (dd, J= 17.7 Hz, 1H, one of 
CsCCH 2 ), 4.14-4.18 (m, 2H, 3'-H and 4'-H), 3.98-4.10 
(m, 2H, C# 2 CH=CH 2 >, 3.93 (dd, J = 2.4, 11.5 Hz, 1H, one 
of 5'-H), 3.84 (dd, J= 2.2, 11.5 Hz, 1H, one of 5' -H) , 
2.44-2.50 (ddd, J = 1.8, 5.7, 13.5 Hz, 1H, one of 2'- 
H), 2.04-2.12 (ddd, J = 5.6, 8.0, 13.5 Hz, 1H, one of 
2'-H), 0.94 (s, 9H, C(CH 3 ) 3 ), 0.16 (s, 3H, one of 
SiCff 3 ) , 0.15 (s, 3H, one of SiCff 3 ) . 13 C NMR (100 MHz, 
CD 3 0D) 5 164.1, 158.0 (q., <J = 37 Hz, COCF 3 ) , 150.6, 
144.3, 135.5, 117.3, 117.1 (q,. j = 2 84 Hz, COCF 3 ) , 
99.5, 88.9, 87.2, 86.9, 80.3, 76.0, 71.0, 64.7, 39.6, 
30.7, 26.6, 19.3, -5.0, -5.2. HRMS (FAB+) m/z: anal. 

Calcd for C 23 H 33 0 6 N 3 F 3 Si (M+H + ) : 532.2091, found: 
532. 2074 . 

3' -O-allyl-5- [3- [ (trif luoroacetyl) amino] -prop-l-ynyl ] - 
2 ' -deoxyuridine (31): The procedure is the same as that 
of 23 and the crude product was purified by flash 
column chromatography over silica gel using ethyl 
acetate-hexane (3:1) as the eluent to afford 31 as 
yellow solid (302 mg; 49% yield). *H NMR (400 MHz, 
CD 3 OD) S 8.29 (s, 1H, 6-H), 6.19 (dd, J = 6.1, 7.4 Hz, 
1H, l'-H), 5.87-5.99 (m, 1H, CH 2 CH=CH 2 ) , 5.27-5.33 (dm, 
J = 17.2 Hz, 1H, one of CH 2 CH=CH 2 ) , 5.15-5.20 (dm, J _ 
10.4 Hz, 1H, one of CH 2 CH=CH 2 ) , 4.27 (s, 2H, C S CCtf 2 ) , 
4.20 (m, 1H, 3'-H), 3.99-4.09 (m, 3H, 4'-H and 
CH 2 CH=CH 2 ), 3.80 (dd, J= 3.3, 12.0 Hz, 1H, one of 5'- 
H), 3.72 (dd, J = 3.4, 12.0 Hz, 1H, one of 5'-H), 2.39- 
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2.46 (ddd, J - 2.6, 5.9, 13.7 Hz, 1H, one of 2'-H), 
2.14-2.22 (ddd, J = 6.3, 7.5, 13.7 Hz, 1H, one of 2'- 
H) . 13 C NMR (100 MHz, CD 3 OD) 5 164.2, 158.0 (q, J"= 38 
Hz, COCF3), 150.8, 145.3, 135.6, 117.2 (q, J — 285 Hz, 
5 COCF3), 117.1, 99.5, 88.3, 87.1, 87.0, 79.9, 75.9, 
71.2, 62.9, 39.0, 30.8. HRMS (FAB+) Calcd for 
C17H19O6N3F3 (M+H + ) : 418.1226, found: 418.1213. 

3' -O-allyl-5- (3-aminoprop-l-ynyl) -2' -deoxyuridine-5' - 
10 triphosphate (32) : The procedure is the same as that of 

preparing 9 to yield 32 as colorless syrup: X H NMR (300 
• MHz, D 2 0) 5 8.31 (s, 1H) , 6.17 (t, 1H) , 5.81-5.90 (m, 

1H), 5.18 (d, 1H), 5.14 (d, 1H), 4.34 (m, 2H) , 4.03- 

4.15 (m, 2H) , 4.00 (d, 2H) , 3.93 (s, 2H) , 2.44-2.47 (m, 
15 1H), 2.22-2.24 (m, 1H) . 31 P NMR (121.4 MHz, D 2 0) 5 -5.90 

(d, J = 19.0 Hz, IP, y-P ), -11.43 (d, J = 20.0 Hz, IP, 

a-P) , -22.25 (t, J = 19.8Hz, IP, (3-P )- 

3' -0-allyl-dUTP-PC-R6G (33): The coupling reaction of 
20 32 with PC-R6G-NHS ester (Ref. 34b) afforded 33, 
following a similar procedure as the preparation of 10. 
3 » -o-allyl-dCTP-PC-Bodipy-650 33 was characterized by 
the primer extension reaction and MALDI-TOF MS. 

25 

II. 3'-0-allyl Modified Photocleavable Fluorescent 
Nucleotides as Reversible Terminators for Primer 
Extension Reactions . 

30 1) Primer Extension by Using 3' -O-Allyl-dGTP-PC-Bodipy- 
FIj-510 (10) and Photocleavage of the Extension Product 

34 . The polymerase extension reaction mixture consisted 
of 60 pmol of primer (5' -GTTGATGTACACATTGTCAA-3 ' ) (SEQ 
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ID NO: 4), 80 pmol of 100-mer template (5'- 
TACCCGGAGGCCAAGTACGGCGGGTACGTCC- 

TTGACAATGTGTACATCAACATCACCTACCACCATGTCAGTCTCGGTTGGATCCT 
CTATTGTGTCCGGG-3' ) (SEQ ID NO : 5 ) , 120 pmol of 3'-0- 
5 allyl-dGTP-PC-Bodipy-FL-510, IX Thermopol reaction 
buffer (20 mM Tris-HCl/10 mM (NH 4 ) 2 SO 4 /10 mM KC1/2 mM - 
MgSO 4 /0.1 % Triton X-100, pH 8.8, New England Biolabs), 
and 6 units of 9°N Polymerase (exo- ) A485L/Y409V in a 
total volume of 20 jal . The reaction consisted of 20 

10 cycles at 94 °C for 20 see, 46 °C for 40 sec, and 60 °C 
for 90 sec. After the reaction, a small portion of the 
DNA extension product was desalted by using ZipTip and 
analyzed by MALDI-TOF MS, which shows a dominant peak 
at m/z 7,052 corresponding to the DNA product 34. The 

15 rest of the product mixture was f reeze-dried, 
resuspended in 200 \il of deionized water, and 
irradiated at 355 nm for 10 sec to cleave the 
fluorophore from the DNA to yield product 35 and then 
analyzed by MALDI-TOF MS. 

20 

Deallyation of photocleaved DNA extension product 35 . 

DNA product 35 (20 pmol) was added to a mixture of 
degassed IX Thermopol reaction buffer (20 mM Tris- 
HCl/10 mM (NH 4 ) 2 SO 4 /10 mM KC1/2 mM MgSO 4 /0.1 % Triton X- 

25 100, pH 8.8, 1 |ul), Na 2 PdCl 4 in degassed H 2 0 (7 23 
nitiol) and P(PhS0 3 Na) 3 in degassed H 2 0 (10 176 nmol) 

to perform deallylation . The reaction mixture was then 
placed in a heating block and incubated at 7 0 °C for 3 0 
seconds to yield quantitatively deallylated DNA product 

30 36 and analyzed by MALDI-TOF MS. 

Primer Extension Reaction Performed with the 
deallylated DNA Product. The deallylated DNA product 36 
was used as a primer in a single-base extension 
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reaction. The 20 ]il reaction mixture consisted of 60 
pmol of the deallylated product 36, 80 pmol of the 100- 
mer template (5' -TACCCGGAGGCCAAGTACGGCGGGTACGTCC- 

TTGACAATGTGTACATCAACATCACCTACCACCATGTCAGTCTCGGTTGGATCCT 

5 CTATTGTGTCCGGG-3 ' ) (SEQ ID NO: 6), 120 pmol of 3' -O- 
allyl-dGTP-PC-Bodipy-FL-510 (10), 6 units of 9°N 
Polymerase (exo-) A485L/Y409V in a total volume of 20 
p.1. The reaction consisted of 20 cycles at 94 °C for 
20 sec, 46 °C for 40 sec, and 60 °C for 90 sec. The DNA 

10 extension product 37 was desalted by using the ZipTip 
protocol, and a small portion was analyzed by using 
MALDI-TOF MS. The remaining product was then irradiated 

* with near-UV light (355 nm) for 10 sec to cleave the 
fluorophore from the extended DNA product. The 

15 resulting photocleavage product 38 was analyzed by 
using MALDI-TOF MS. Finally, deallylation of the 
photocleavage product 38 was performed using a Pd- 
catalyzed deallylation ■ reaction resulting in a 
deallylated DNA product 39, which was then analyzed by 

20 MALDI-TOF MS . 

2) Primer Extension with 3' -O-Allyl-dATP-PC-ROX (19), 
followed by Photocleavage and Deallylation of the 
Extension Product. The polymerase extension reaction 

25 mixture consisted of 60 pmol of primer (5'- 
TAGATGACCCTGCCTTGTCG-3' ) (SEQ ID NO:7), 80 pmol of 100- 
mer template ( 5 ' - GAAGGAGACACGCGGCCAGAGAGGGT- 

CCTGTCCGTGTTTGTGCGTGGAGTTCGACAAGGCAGGGTCATCTAATGGTGATGA 
. GTCCTATCCTTTTCTCTTCGTTCTCCGT-3 9 ) (SEQ ID NO: 8), 12 0 

30 pmol of 3'-0-allyl-dUTP-PC-R6G, 120 pmol of 3'-0-allyl- 
dATP-PC-ROX, 120 pmol of 3' -O-allyl-dGTP-PC-Bodipy-FL- 
510, 120 pmol of 3'-O-allyl-dCTP-PC-Bodipy-650, IX 
Thermopol reaction buffer (20 mM Tris-HCl 

/10 mM (NH 4 ) 2 SO 4 /10 mM KC1/2 mM MgSO 4 /0.1 % Triton X- 
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100, pH 8.8, New England Biolabs) , and 6 units of 9°N 
Polymerase (exo-) A485L/Y409V in a total volume of 20 

juil . The reaction consisted of 20 cycles at 94 °C for 
20 sec, 55 °C for 40 sec, and 68 °C for 90 sec, which 
5 yielded DNA extension product 40. DNA extension product 

mixture was f reeze-dried, resuspended in 200 jal of 
deionized water, and irradiated at 355 nm for 10 sec to 
cleave the fluorophore from the DNA to yield DNA 
product 41 and then analyzed by MALDI-TOF MS. Finally, 
10 deallylation of the photocleavage product was performed 
using a Pd-catalyzed deallylation reaction resulting in 
a deallylated DNA product 42, which was then analyzed 
by MALDI-TOF MS. 

15 3) Primer Extension with 3' -O-Allyl-dCTP-PC-Bodipy-650 
(26) , followed by Photocleavage and Deallylation of the 

i 

Extension Product. The polymerase extension reaction 
mixture consisted of 60 pmol of primer (5'- 
ACACAATAGAGGATCCAACCG AGA-3') (SEQ ID NO: 9), 80 pmol of 

20 100-mer template (5'- TACCCGGAGGCCAAGTACGGCGGGT 
ACGTCCTTGACAATGTGTACATCAACATCACCTACCACCATGTCAGTCTCGGTTG 
GATCCTCTATTGTGTCCGGG-3' ) (SEQ ID NO:10), 120 pmol of 
3' -0-allyl-dUTP-PC-R6G, 120 pmol of 3' -O-allyl-dATP-PC- 
ROX, 120 pmol of 3 ' -O-allyl-dGTP-PC-Bodipy-FL-51 0 , 120 

25 pmol of 3' -O-allyl-dCTP-PC-Bodipy-650, IX Thermopol 
reaction buffer (20 mM Tris- 

HC1/10 mM (NH 4 )2SO 4 /10 mM KC1/2 mM MgSO 4 /0.1 % Triton X- 
100, pH 8.8, New England Biolabs), and 6 units of 9°N 
Polymerase (exo- ) A485L/Y409V in a total volume of 20 

30 ill . The reaction consisted of 20 cycles at 94 °C for 
20 sec, 64 °C for 40 sec, and 72 °C for 90 sec, which 
yielded DNA extension product 43. DNA extension product 

mixture was f reeze-dried, resuspended in 200 jj.1 of 
deionized water, and irradiated at 355 nm for 10 sec to 
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cleave the fluorophore from the DNA to yield DNA 
product 44 and then analyzed by MALDI-TOF MS. Finally, 
deallylation of the photocleavage product was performed 
using a Pd-catalyzed deallylation reaction resulting in 
5 a deallylated DNA product 45, which was then analyzed 
by MALDI-TOF MS, 

4) Primer Extension with 3' -0-Allyl-dUTP-PC-R6G (33), 
followed, by Photocleavage and Deallylation of the 

10 Extension Product. The polymerase extension reaction 
mixture consisted of 60 pmol of primer (5'- 
GATAGGACTCATCACCA-3' ) (SEQ ID N0:11), 80 pitiol of 100- 
mer template (5'- 

GAAGGAGACACGCGGCCAGAGAGGGTCCTGTCCGTGTTTGT GCG 

15 TGGAGTTCGACAAGGCAGGGTCATCTAATGGTGATGAGTCCTATCCTTT 

TCTCTTCGTTCTCCGT-3' ) (SEQ ID NO: 12), 12 0 pmol of 3'-0- 
allyl-dUTP-PC-R6G, 120 pmol of 3' -O-allyl-dATP-PC-ROX, 
120 pmol of 3' -O-allyl-dGTP-PC-Bodipy-FL-510, 120 pmol 
of 3' -O-allyl-dCTP-PC-Bodipy-650, IX Thermopol reaction 

20 buffer (20 mM Tris-HCl 

/10 mM (NH 4 ) 2 SO 4 /10 mM KC1/2 mM MgSO 4 /0.1 % Triton X- 
100, pH 8.8, New England Biolabs), and 6 units of 9°N 
Polymerase (exo- ) A485L/Y409V in a total volume of 20 

jj.1 . The reaction consisted of 20 cycles at 94 °C for 
25 20 sec, 46 °C for 40 sec, and 60 °C for 90 sec, which 
yielded DNA extension product 46. DNA extension product 

mixture was f reeze-dried, resuspended in 200 p.1 of 
deionized water, and irradiated at 355 nm for 10 sec to 
cleave the fluorophore from the DNA to yield DNA 
30 product 47 and then analyzed by MALDI-TOF MS. Finally, 
deallylation of the photocleavage product was performed 
using a Pd-catalyzed deallylation reaction resulting in 
a deallylated DNA product 48, which was then analyzed 
by MALDI-TOF MS. 
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Example 3 



Four-Color DNA Sequencing by Synthesis on a Chip Using 
5 Photocleavable Fluorescent Nucleotides 

Synopsis 

In this example, 4-color DNA sequencing by synthesis 
(SBS) on a chip using four photocleavable fluorescent 

10 nucleotide analogues (dGTP-PC-Bodipy-FL-510, dUTP-PC- 
R6G, dATP-PC-ROX, and * dCTP-PC-Bodipy-650) is 
demonstrated. Each nucleotide analogue consists of a 
different fluorophore attached to the 5-position of the 
pyrimidines (C and U) and the 7-position of the purines 

15 (G and A) through a photocleavable 2-nitrobenzyl 
linker. After verifying that these nucleotides could be 
successfully incorporated into a growing DNA strand in 
a solution-phase polymerase reaction and the 
fluorophore could be cleaved using laser irradiation 

20 (X~355 nm) in 10 seconds, an SBS reaction was then 
performed on a chip which contains a self-priming DNA 
template covalently immobilized using 1,3-dipolar 
azide-alkyne cycloaddition . The DNA template was 
produced by a polymerase chain reaction using an azido- 

2 5 labeled primer and the self -priming moiety was attached 
to the immobilized DNA template by enzymatic ligation. 
Each cycle of SBS consists of the incorporation of the 
photocleavable fluorescent nucleotide into the DNA, 
detection of the fluorescent signal and photocleavage 

30 of the. fluorophore. The entire process was repeated to 
identify 12 continuous bases in the DNA template. These 
results demonstrate that photocleavable fluorescent 
nucleotide analogues can be incorporated accurately 
into a growing- DNA strand during a polymerase reaction 
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in solution phase as well as on a chip. Moreover, all 4 
f luorophores can be detected and then efficiently 

mg near-UV irradiation, thereby allowing 
continuous identification of the DNA template sequence. 
5 Optimization of the steps involved increases the 
readlength . 



Results 



20 



10 DNA sequencing is a fundamental tool for biological 
science. The completion of the Human Genome Project has 
set the stage for screening genetic mutations to 
identify disease genes on a genome-wide scale (42) . 
Accurate high-throughput DNA sequencing methods are 
15 needed to explore the complete human genome sequence 
for applications in clinical medicine and health care. 
Recent studies have indicated that an important route 
for identifying functional elements in the human genome 
involves sequencing the genomes of many species 
representing a wide sampling of the evolutionary tree 
(43) . To overcome the limitations of the current 
electrophoresis-based sequencing technology (44-47), a 
variety of new DNA-sequencing methods have been 
investigated. Such approaches include sequencing by 
25 hybridization (48), mass spectrometry based sequencing 
(49-51), sequence-specific detection of single-stranded 
DNA using engineered nanopores (52) . More recently, DNA 
sequencing by synthesis (SBS) approaches such as 
pyrosequencing (53), sequencing of single DNA molecules 
30 (54) and polymerase colonies (55) have been widely 
explored . 



The concept of DNA sequencing by synthesis was revealed 
in 1988 (56) . This approach involves detection of the 
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identity of each nucleotide immediately after its 
incorporation into a growing strand of DNA in a 
polymerase reaction. Thus far, no complete success has 
been reported in using such a system to sequence DNA 
5 unambiguously- An SBS approach was proposed using 
photocleavable fluorescent nucleotide analogues on a 
surface in 2000 (57). In this approach, modified 
nucleotides are used as reversible terminators, in 
which a different fluorophore with a distinct 
10 fluorescent emission is linked to each of the 4 bases 
through a photocleavable linker and the 3 1 -OH group is 
capped by a small chemical moiety. DNA polymerase 
incorporates only a single nucleotide analogue 
complementary to the base on a DNA template covalently 
15 linked to a surface. After incorporation, the unique 
fluorescence emission is detected to identify the 
incorporated nucleotide and the fluorophore is 
subsequently removed photochemically . The 3 1 -OH group 
is then chemically regenerated, which allows the next 
2 0 cycle of the polymerase reaction to proceed. Since the 
large surface on a DNA chip can have a high density of 
different DNA templates spotted, each cycle can 
identify many bases in parallel, allowing the 
simultaneous sequencing of a large number of DNA 
25 molecules. The advantage of using photons as reagents 
for initiating photoreactions to cleave the fluorophore 
is that no additional chemical reagents are required to 
be introduced into the system and clean products can be 
generated with no need for subsequent purification. It 
30 has previously been established the feasibility of 
performing SBS on a chip using a synthetic DNA template 
and photocleavable pyrimidine nucleotides (C and U) 
(58) . As further development of this approach, here the 
design and synthesis of 4 photocleavable nucleotide 
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analogues (A, C, G, U) is reported, each of which 
contains a unique fluorophore with a distinct 
fluorescence emission. Initially, it is established 
that these nucleotides are good substrates for DNA 
5 polymerase in a solution-phase DNA extension reaction 
and that the fluorophore can be removed with high speed 
and efficiency by laser irradiation (D~355 nm) . 
Subseqencetly, SBS was performed using these 4 
photocleavable nucleotide analogues to identify the 

10 sequence of a DNA template immobilized on a chip. The 
DNA template was produced by PCR using an azido-labeled 
primer, and was immobilized on the surface of the chip 
with 1,3-dipolar azide-alkyne cycloaddition chemistry. 
A self-priming moiety was then covalently attached to 

15 the DNA template by enzymatic ligation to allow the 
polymerase reaction to proceed on the DNA immobilized 
on the surface . 



20 



Materials and Methods 



All chemicals were purchased from Sigma-Aldrich unless 
otherwise indicated. 1H NMR spectra were recorded on a 
Bruker 400 spectrometer. High-resolution MS (HRMS) data 
were obtained by using a JEOL JMS HX 110A mass 

2 5 spectrometer. Mass measurement of DNA was made on a 
Voyager DE matrix-assisted laser desorption ionization- 
time-of- flight (MALDI-TOF) mass spectrometer (Applied 
Biosystems) . Photolysis was performed using a Spectra 
Physics GCR-150-30 Nd-YAG laser that generates light 

30 pulses at 355 nm (ca. 50 mJ/pulse, pulse length ca. 7 
ns) at a frequency of 3 0 Hz with a light intensity at 
ca. 1.5 W/cm2. The scanned fluorescence emission images 
were obtained by using a ScanArray Express scanner 
(Per kin-Elmer Life Sciences) equipped with four lasers 
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with excitation wavelengths of 488, 543, 594, and 633 

nm and emission filters centered at 522, 570, 614, and 
670 nm. 



5 Synthesis of photocleavable fluorescent nucleotides . 

Photocleavable fluorescent nucleotides dGTP-PC-Bodipy- 
FL-510, dUTP-PC-R6G, dATP-PC-ROX and dCTP-PC-Bodipy-650 
(Fig. 16) were synthesized according to Fig. 17 using a 
similar method as reported previously (57) . A 

10 photocleavable linker (PC-Linker) 1- [ 5- (aminomethyl ) -2- 
nitrophenyl] ethanol was reacted with the NHS ester of 
the corresponding fluorescent dye to produce an 
intermediate PC- Dye, which was converted to a PC-Dye 
NHS ester by reacting with N, N' -disuccinimidyl 

15 carbonate. The coupling reaction between the different 
PC-Dye NHS esters and the amino nucleotides (dATP-NH2 
and dGTP-NH2 from Perkin-Elmer ; dUTP-NH2 from Sigma; 
dCTP-NH2 from TriLink BioTechnologies ) produced the 4 
photocleavable fluorescent nucleotides . 

20 

DNA polymerase reaction using 4 photocleavable 
fluorescent nucleotide analogues in solution. 
Four nucleotide analogues were characterized, dGTP-PC- 
Bodipy-FL-510, dUTP-PC-R6G, dATP-PC-ROX and dCTP-PC- 

25 Bodipy-650 by performing four continuous DNA-extension 

t 

reactions sequentially using a primer (5'- 
AGAGGATCCAACCGAGAC-3' ) (SEQ ID NO: 13) and a synthetic 
DNA template (5'- 

GTGTACATCAACATCACCTACCACCATGTCAGTCTCGGTTGGAT- 
30 CCTCTATTGTGTCCGG-3' ) (SEQ ID NO:14) corresponding to a 
portion of exon 7 of the human p53 gene (Fig. 18) . The 
four nucleotides in the template immediately adjacent 
to the annealing site of the primer were 3'-ACTG-5'. 
First, a polymerase extension reaction using dUTP-PC- 
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R6G along with the primer and the template was 
performed producing a single base extension product. 
The reaction mixture for this, and all subsequent 
extension reactions, consisted of 8 0 pmol of template, 
5 50 pmol of primer, 80 pmol of the particular 
photocleavable fluorescent nucleotide, IX Thermo 
Sequenase reaction buffer, and 4 U of Thermo Sequenase 
DNA polymerase (Amersham Biosciences) in a total volume 
of 20 JJ.L . The reaction consisted of 25 cycles at 94 °C 

10 for 20 sec, 48 °C for 40 sec, and 60 °C for 75 sec. 
Subsequently, the extension product was purified by 
using reverse-phase HPLC . An Xterra MS C18 (4.6 x 50- 
mm) column (Waters) was used for the HPLC purification. 
Elution was performed over 120 minutes at a flow rate 

15 of 0.5 mL/min with the temperature set at 50 oC by 
using a linear gradient (12-34.5%) of methanol in a 
buffer consisting of 8.6 mM triethylamine and 100 mM 
hexafluoroisopropyl alcohol (pH 8.1). The fraction 
containing the desired DNA product was collected and 

20 freeze-dried for analysis using MALDI-TOF mass 
spectrometry. For photocleavage, the purified DNA 
extension product bearing the fluorescent nucleotide 
analogue was resuspended in 200 fxL of deionized water. 
The mixture was irradiated for 10 seconds in a quartz 

25 cell with path lengths of 1.0 cm employing a Nd-YAG 
laser at 355 nm and then analyzed by MALDI-TOF 4 MS. 
After photocleavage, the DNA product with the 
fluorophore removed was used as a primer for a second 
extension reaction using dGTP-PC-Bodipy-FL-510 . The 

30 second extended product was then purified by HPLC and 
photolyzed. The third extension using dATP-PC-ROX and 
the fourth extension using dCTP-PC-Bodipy-650 were 
carried out in a similar manner using the previously 
extended and photocleaved product as the primer. 
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PCR amplification to produce azido-labeled DNA 
template . 

An azido-labeled PCR product was obtained using a 100- 
5 bp template (5'- 

AGCGACTGCTATCATGTCATATCGACGTGCTCACTAGCTCTACATATGCGTGCGT 
GATCAGATGACGTATCGATGCTGACTATAGTCTCCCATGCGAGTG-3' ) ( SEQ 
ID NO: 15), a 24-bp azido-labeled forward primer <5'-N3- 
AGCGACTGCTATCATGTCATATCG-3' ) (SEQ ID NO:16), and a 24- 
10 bp unlabeled reverse primer (5'- 

CACTCGCATGGGAGACTATAGTCA-3' ) (SEQ ID NO: 17). In a total 

reaction volume of 50 j-iL , 1 pmol of template and 30 
pmol of forward and reverse primers were mixed with 1 U 
of AccuPrime Pfx DNA polymerase and 5 fxL of 10X 

15 AccuPrime Pfx reaction mix (Invitrogen) containing 1 mM 
of MgS0 4 and 0.3 mM of dNTP. The PCR reaction consisted 
of an initial denaturation step at 95 °C for 1 min, 
followed by 38 cycles at 94 °C for 15 sec, 63 °C for 30 
sec, 68 °C for 30 sec. The product was purified using a 

2 0 9 6 QIAquick multiwell PCR purification kit (Qiagen) and 
the quality was checked using 2% agarose gel 
electrophoresis in IX TAE buffer. The concentratibn of 
the purified PCR product was measured using a Perkin- 
Elmer Lambda 4 0 UV-Vis spectrophotometer. 

25 

Construction of a self-priming DNA template on a chip 
by enzymatic ligation . 

The amino-modif ied glass slide (Sigma) was 
f unctionalized to contain a terminal alkynyl group as 
30 described previously (57) . The azido-labeled DNA 
product generated by PCR was dissolved in DMSO/H2O 

(1/3, v/v) to obtain a 20 |iM sblution. 5 ixL of the DNA 
solution was mixed with Cul (10 nmol, 100 eq. ) and N,N- 
diisopropyl-ethylamine (DIPEA) (10 nmol, 100 eq.) and 
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then spotted onto the alkynyl -modified glass surface in 
the form of 6 \xL drops. The glass slide was incubated 
in a humid chamber at room temperature for 24 hr, 
washed with deionized water (dH 2 0) and SPSC buffer (50 
5 mM sodium phosphate, 1 M NaCl, pH 6.5) for 1 hr (57), 
and finally rinsed with dH 2 0. To denature the double 
stranded PCR- amplified DNA to remove the non-azido- 
labeled strand, the glass slide was immersed into 0.1 M 
NaOH solution for 10 min and then washed with 0.1 M 

10 NaOH and dH 2 0, producing a single stranded DNA template 
that is immobilized on the chip. For the enzymatic 
ligation of a self-priming moiety to the immobilized 
DNA template on the chip, a 5 ' -phosphorylated 40-bp 
loop primer (5'-P03- 

15 GCTGAATTCCGCGTTCGCGGAATTCAGCCACTCGCATGGG-3' ) (SEQ ID 
NO: 18) was synthesized. This primer contained a 
thermally stable loop sequence 3' -G (CTTG) C-5' , a 12-bp 
stem, and a 12-bp overhanging end that would be 
annealed to the immobilized single stranded template at 

20 its 3' -end. A 10 p,L solution consisting of 100 pmol of 
the primer, 10 U of Taq DNA ligase, 0 . 1 mM NAD, and IX 
reaction buffer (New England Biolabs) was spotted onto 
• a location of the chip containing the immobilized DNA 
and incubated at 45 °C for 4 hr. The glass slide was 

2 5 washed with dH 2 0, SPSC buffer, and again with dH 2 0. The 
formation of a stable hairpin was ascertained by 
covering the entire surface with IX reaction buffer (2 6 
mM Tris~HCl/6.5 mM MgCl 2 , pH 9.3), incubating it in a 
humid chamber at 94 °C for 5 min to dissociate any 

30 partial hairpin structure, and then slowly cooling down 
to room temperature, for reannealing. 

SBS reaction on a chip with four photocleavable 
fluorescent nucleotide analogues. 
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One microliter of a solution consisting of dATP-PC-ROX 
(60 pmol) , 2 U of Thermo Sequenase DNA polymerase, and 
IX reaction buffer was spotted on the surface of the 
chip, where the self-primed DNA moiety was immobilized. 
5 The nucleotide analogue was allowed to incorporate into 
the primer at 72 °C for 5 min. After washing with a 
mixture of SPSC buffer, 0.1% SDS, and 0.1% Tween 20 for 
10 min, the surface was rinsed with dH 2 0 and ethanol 
successively, and then scanned with a ScanArray Express 

10 scanner to detect the fluorescence signal. To perform 
photocleavage, the glass chip was placed inside a 
chamber (50 x 50 x 50 mm) filled with 
acetonitrile/water (1/1, v/v) solution and irradiated 
for 1 min with the Nd-YAG laser at 355 nm. The light 

15 intensity applied on the glass surface was ca . 1.5 
W/cm 2 . After washing the surface with dH 2 0 and ethanol, 
the surface was scanned again to compare the intensity 
of fluorescence after photocleavage with the original 
fluorescence intensity. This process was followed by 

20 the incorporation of dGTP-PC-Bodipy-FL-510 , with the 
subsequent washing, fluorescence detection, and 
photocleavage processes performed as described above. 
The same cycle was repeated 10 more times using each of 
the four photocleavable fluorescent nucleotide 

2 5 analogues complementary to the base on the template. 

For a negative control experiment, 1 |iL solution 
containing dATP-PC-ROX (60 pmol), and IX reaction 
buffer was added on to the DNA immobilized on the chip 
in the absence of DNA polymerase and then incubated at 

3 0 72 °C for 5 min, followed by the same washing and 

detection steps as above. 
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To demonstrate the feasibility of carrying out DNA 
sequencing by synthesis on a chip, four photocleavable 
fluorescent nucleotide analogues (dGTP-PC-Bodipy-FL- 
510, dUTP-PC-R6G, dATP-PC-ROX, and dCTP-PC-Bodipy-650 ) 
5 (Fig. 16) were synthesized according to Fig. 17 using a 
similar procedure as reported previously (57) . Modified 
DNA polymerases have been shown to be highly tolerant 
to nucleotide modifications with bulky groups at the 5- 
position of pyrimidines (C and U) and the 7-position of 
10 purines (A and G) (59, 60) . Thus, each unique 
fluorophore was attached to the 5 position of C/U and 
the 7 position of A/G through a photocleavable 2- 
nitrobenzyl linker . 

15 In - order to verify that these fluorescent nucleotides 
are incorporated accurately in a base-specific manner 
in a polymerase reaction, four continuous steps of DNA 
extension and photocleavage by near UV irradiation were 
carried out in solution as shown in Fig. 18. This 

2 0 allows the isolation of the DNA product at each step 
for detailed molecular structure characterization as 
shown in Fig. 19. The first extension product 5' — U(PC- 
R6G)-3' 1 was purified by HPLC and analyzed using 
MALDI-TOF MS [Fig. 19(1)]. This product was then 

25 irradiated at 355 nm using an Nd-YAG laser for 10 
seconds and the photocleavage product 2 was also 
analyzed using MALDI-TOF MS [Fig. 19(2)]. Near UV light 
absorption by the aromatic 2-nitrbbenzyl linker causes 
reduction of the 2-nitro group to a nitroso group and 

30 an oxygen insertion into the carbon-hydrogen bond 
followed by cleavage and decarboxylation (61). As can 
be seen from Fig. 19(1), the MALDI-TOF MS spectrum 
consists of a distinct peak at m/z 6536 corresponding 
to the DNA extension product 5' — U(PC-R6G)-3' (1) 
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which confirms that the nucleotide analogue can be 
incorporated base specifically by DNA polymerase into a 
growing DNA strand. The small peak at m/z 5872 
corresponding to the photocleavage product is due to 
5 the partial cleavage caused by the nitrogen laser pulse 
(337 ran) used in MALDI ionization. For photocleavage, 
a Nd-YAG laser was used to irradiate the DNA product 
carrying the fluorescent nucleotide for 10 seconds at 
355 run to cleave the fluorophore from the DNA extension 

10 product. Fig. 19(2) shows the photocleavage result of 
the above DNA product. The peak at m/z 6536 has 
completely disappeared while the peak corresponding to 
the photocleavage product 5' — U (2) appears as the sole 
dominant peak at m/z 5872, which establishes that laser 

15 irradiation completely cleaves the fluorophore with 
high speed and efficiency without damaging the DNA. The 
next extension reaction was carried out using this 
photocleaved DNA product as a primer along with dGTP- 
PC-Bodipy-FL-510 to yield an extension product 5' — 

20 UG (PC-Bodipy-FL-510) -3' (3). As described above, the 

i 

extension product 3 was purified, analyzed by MALDI— TOF 
MS producing a dominant peak at m/z 6751 [Fig. 19(3)], 
and then photocleaved for further MS analysis yielding 
a single peak at m/z 6255 (product 4) [Fig. 19(4)]. 

25 The third extension using dATP-PC-ROX to yield 5' — 
UGA(PC-ROX) -3' (5), the fourth extension using dCTP-PC- 
Bodipy-650 to yield 5' — UGAC (PC-Bodipy-650) -3' (7) and 
their photocleavage to yield products 6 and 8 were 
similarly carried out and analyzed by MALDI-TOF MS as 

30 shown in Figs. 19(5), 19(6), 19(7) and 19(8). These 
results demonstrate that the above-synthesized four 
photocleavable fluorescent nucleotide analogues can 
successfully incorporate into the growing DNA strand in 
a polymerase reaction, and the fluorophore can be 
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efficiently cleaved by near UV irradiation, which makes 
it feasible to use them for SBS on a chip. 

The photocleavable fluorescent nucleotide analogues 
5 were then used in an SBS reaction to identify the 
sequence of the DNA template immobilized on a solid 
surface as shown in Fig. 20. A site-specific 1,3- 
dipolar cycloaddition coupling chemistry was used to 
covalently immobilize the azido-labeled double-stranded 

10 PCR products on the alkynyl-f unctionalized surface in 
the presence of a Cu(I) catalyst. Previously, it has 
shown have shown that DNA is successfully immobilized 
on the glass surface by this chemistry and evaluated 
the functionality of the surface-bound DNA and the 

15 stability of the array using a primer extension 
reaction (57) . The surface-immobilized double stranded 
PCR product was denatured using a 0.1 M NaOH solution 
to remove the complementary strand without the azido 
group, thereby generating a single-stranded PCR 

2 0 template on the surface. Then, a 5' -phosphorylated 
self -priming moiety (loop primer) was ligated to the 
3' -end of the above single stranded DNA template using 
Taq DNA ligase (21) . The structure of the loop primer 
was designed to bear a thermally stable loop (22) and 

2 5 stem sequence with a melting temperature of 8 9 °C . The 
12-bp overhanging portion of the loop primer was made 
complementary to the 12-bp sequence of the template at 
its 3' end to allow the Taq DNA ligase to seal the nick 
between the 5' -phosphate group of the loop primer and 

30 the 3'-hydroxyl group of the single-stranded DNA 
template. This produces a unique DNA moiety that can 
self -prime for the synthesis of a complementary strand. 
The ligation was found to be in quantitative yield in a 
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parallel solution-phase reaction using the same primer 
and single-stranded DNA template. 

The principal advantage offered by the use of a self- 
5 priming moiety as compared to using separate primers 
and templates is that the covalent linkage of the 
primer to the template in the self-priming moiety 
prevents any possible dissociation of the primer from 
the template under vigorous washing conditions . 

10( Furthermore, the possibility of mispriming is 
considerably reduced and a universal loop primer can be 
used for all the templates allowing enhanced accuracy 
and ease of operation. SBS was performed on the chip- 
immobilized DNA template using the 4 photocleavable 

15 fluorescent nucleotide analogues and the results are 
shown in Fig. 21. The structure of the self-priming DNA 
moiety is shown schematically in the upper panel, with 
the first 12 nucleotide sequence immediately after the 
priming site. The sequencing reaction on the chip was 

2 0 initiated by extending the self-priming DNA using dATP- 
PC-ROX (complementary to the T on the template) , and 
Thermo Sequenase DNA polymerase. After washing, the 
extension of the primer by a single fluorescent 
nucleotide was confirmed by observing an orange signal 

25 (the emission signal from ROX) in a microarray scanner 
[Fig. 21(1)]. After detection of the fluorescent 
signal, the surface was irradiated at 355 run for 1 min 

* 

using an Nd-YAG laser to cleave the f luorophore . The 
surface was then washed, and a negligible residual 
30 fluorescent signal was detected to confirm complete 
photocleavage of the fluorophore [Fig. 21(2)]. This was 
followed by incorporation of the next fluorescent 
nucleotide complementary to the subsequent base on the 
template. The entire process of incorporation, 
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detection and photocleavage was performed multiple 
times using the four photocleavable fluorescent 
nucleotide analogues to identify 12 successive bases in 
the DNA template. The integrated fluorescence intensity 
5 on the spot, obtained from the scanner software, 
indicated that the incorporation efficiency was over 
90% and more than 97% of the original fluorescence 
signal was removed by photocleavage. A negative control 

experiment consisting of incubating the self-priming 
10 DNA moiety with dATP-PC-ROX in the absence of DNA 

polymerase and washing the surface showed that 

negligible fluorescence remained as compared to that of 

Fig. 21 (1) . 

15 In summary, synthesis and characterization of four 
photocleavable fluorescent nucleotide analogues are 
disclosed here, and their use to produce 4-color DNA 
sequencing data on a chip. These nucleotides have been 
shown to be excellent substrates for the DNA polymerase 
and the fluorophore could be cleaved efficiently using 
near UV irradiation. This is important with respect to 
enhancing the speed of each cycle in SBS for high 
throughput DNA analysis. It has also been demonstrated 
that a PCR-amplified DNA template can be ligated with a 
self-priming moiety and its sequence can be accurately 
identified in a DNA polymerase reaction on a chip, 
indicating that a PCR product from any organism can be 
potentially used as a template for the SBS system in 
the future. The modification of the 3' -OH of the 
photocleavable fluorescent nucleotide ' with a small 
chemical group to allow reversible termination is 
reported in (58) . The library of photocleavable 
fluorescent nucleotides reported here should also 
facilitate the development of single molecule DNA 
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sequencing approaches. Thus, by further improving the 
readlength and incorporation efficiency, this approach 
potentially can be developed into a high-throughput 
DNA-analysis system for biological research and medical 
5 applications . 



I 1 

WO 2007/053702 PCT/US2006/042698 

76 



I. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, 
C, Zody, M.C., Baldwin, J., Devon, K., Dewar, K-, 

5 Doyle, M. , FitzHugh, W. et al. (2001) Nature 409, 

860 -921. 

2- Venter, J.C., Adams, M.D., Myers, E.W., Li, P.W., 
Mural, R.J., Sutton, G.G., Smith, H.O., Yandell, 
M . , Evans, C.A., Holt, R.A. et al. (2001) Science 
10 291, 1304 -1351. 

3. - Roses, A. D. (2000) Nature 405, 857 - 865. 

4. Collins, F. S., Green, E . D., Guttmacher, A. E . & 
Guyer, M. S. (2003) Nature 422, 835 - 847. 

5. Friedman, L. S., Ostermeyer, E. A., Szabo, C. I., 
15 Dowd, P., Lynch, E. D. , Rowell, S. E., & King, M . - 

C. (1994) Nature Genetics 8, 399 - 404. 

6. Stickney, H. L., Schmutz, J., Woods, I. G., 
Holtzer, C. C, Dickson, M. C, Kelly, P. D. , 
Myers, R. M. & Talbot, W. S. (2 002) Genome Res. 

20 12, 1929 - 1934. 

7. Smith, L. M. , Sanders, J. Z., Kaiser, R. J., 
Hughes, P., Dodd, C, Connell, C. R. , Heiner, C, 
Kent, S. B. H. & Hood, L. E. (1986) Nature 321, 
674 - 679. 

25 8. Ju, J., Ruan, C, Fuller, C. W., Glazer, A. N. & 

Mathies, R. A. (1995) Proc. Natl. Acad. Sci. USA 

92, 4347 - 4351. 
9. Ju, J., Glazer, A. N. & Mathies, R. A. (1996) 

Nucleic Acids Res. 24, 1144 - 1148. 
30 10. Kan, C. W., Fredlake, C. P, Doherty, E. A. S. & 

Barron, A. E. (2004) Electrophoresis, 25, 3564- 

3588. 

II. Kheterpal, I., Scherer, J., Clark, S. M., 
Radhakrishnan, A., Ju, J., . Ginther, C. L., 



(' 

WO 2007/053702 



77 



PCT7US2006/042698 



Sensabaugh, G . P. & Mathies, R. A. (1996) 
Electrophoresis 17, 1852 - 1859. 
12. Ju, J., Li, Z., Edwards, J. & Itagaki, Y. (2003) 
U. S. Patent 6,664,079. 
5 13. Li, Z., Bai, X., Ruparel, H., Kim, S., Turro, N. 

J. & Ju, J. (2003) Proc. Natl. Acad. Sci. USA 100, 

414-419. 

14. Seo, T. S., Bai, X., Ruparel, H., Li, Z., Turro, 
N . J. & Ju, J. (2004) Proc. Natl. Acad. Sci. USA 

10 101, 5488 - 5493. 

15. Seo, T. S., Bai, X., Kim D. H., Meng, Q. , Shi, S., 
Ruparel, H., Li, Z., Turro, N. J. & Ju, J. (2005) 
" Four-Color DNA Sequencing by Synthesis on a Chip 
Using Photocleavable Nucleotides'", submitted to 

15 Proc. Natl. Acad. Sci. USA. 

16. Pelletier, H., Sawaya, M. R., Kumar, A., Wilson, 
S. H. & Kraut, J. (1994) Science 264, 1891 - 1903. 

17. Axelrod, V. D., Vartikyan, R. M . , Aivazashvili, 
V. A. & Beabealashvili, R. S. (1978) Nucleic Acids 

20 Res. 5, 3549 - 3563. 

18. Metzker, M. L., Raghavachari , R., Richards, S., 
Jacutin, S. E . , Civitello, A., Burgess, K. & 
Gibbs, R . A. (1994) Nucleic Acids Res. 22, 4259 - 
4267. 

25 19. Beabealashvili R. S., Scamrov, A. V., Kutateladze, 

T. V., Mazo, A. M . , Krayevsky, A. A. & Kukhanova 
M. K. (198 6) Biochim. Biophys. Acta 868, 136 - 
144 . 

20. Kutateladze, T. V., Kritzyn, A. M . , Florentjev, V. 

■ 

30 L., Kavsan, V. M . , Chidgeavadze , Z. G. & 

Beabealashvili, R. S. (1986) FEES 201, 205 - 212. 

21. Chidgeavadze, Z. G. & Beabealashvili, R. S. (1984) 
Nucleic Acids Res. 12, 1671 - 1686. 



WO 2007/053702 PCT/US2006/042698 

78 

22. Canard, B., Cardona, B. & Sarfati, R- S. (1995) 
Proc. Natl. Acad. Sci. USA 21, 10859 - 10863. 

23. Guibe, F. (1998) Tetrahedron 54, 2967-3042. 

24. Sabitha, G . , Sbabu, R., Rajkumar, M., Srividya, R. 
5 & Yadav. J. S. (2001) Org. Lett. 3, 1149-1151. 

25. Edwards, J. R., Itagaki, Y. & Ju, J. (2001) 
Nucleic Acids Res. 29, el04. 

26. Karakawa, M . , Kamitakahara, H., Takano, T. & 
Nakatsubo. F. (2002) Biomacromolecules 3, 538-546. 

10 27 . Honda, M. , Morita, H. & Nagakura, I. (1997) J . 

Org. Chem. 62, 8932-8936. 

28. Lacroix, T . , Bricout, H . , Tilloy, S. & Monflier, 
E. (1999) Eur. J. Org. Chem. 11, 3127-3129. 

29. Lemaire, S., Savignac, M . , Blart, E. & Bernard, J. 
15 M. (1997) Tetrahedron Lett 38, 2955-2958. 

30. Genet, J. P., Blart, E. & Savignac, M. (1994) 
Tetrahedron 50(2), 497-503. 

31. Milton, J., Wu, X., Smith, M. , Brennan, J., 
Barnes, C, Liu, X. & Ruediger, S. (2004) PCT 

20 Intl. Patent. Appl . WO 0418497. 

32. DeVasher, R. B., Moore, L. R. & Shaughnessy, K. H. 
(2004) J. Org. Chem. 69, 7919-7927. 

33. Ju, J./ Li, Z.; 'Edwards, J.; Itagaki, Y. Massive 
parallel method for decoding DNA and RNA. U.S. 

25 Patent 2003, 6,664,079. 

34. (a) Ruparel, H. ; Bi, L.; Li, Z.; Bai, X.; Kim, D. 
H.; Turro, N. J.; Ju, J. Design and synthesis of a 
3'-0-allyl photocleavable fluorescent nucleotide 
as a reversible terminator for DNA sequencing by 

30 synthesis. Proc. Natl. Acad. Sci. USA 2005, 102, 

5932-5937. (b) Seo, T. S.; Bai, X.; Kim, D. H.; 
Meng, Q.; Shi, S.; Ruparel, H.; Li, Z.; Turro, N. 
J.; Ju, J. Four-color DNA sequencing by synthesis 
on a chip using photocleavable fluorescent 



WO 2007/053702 PCT7US2006/042698 

79 

nucleotides. Proc. Natl. Acad. Sci. USA 2005, 102, 
5926-5931. 

35. See the Material and Methods section for 
experimental procedures and characterization data 

5 of the compounds synthesized. 

36. Seela, F . ; Driller, H. 7-deaza-2 ' -deoxy-06- 
methylguanosine : selective N2-f ormylation via a 
f ormamidine, phosphoramidite synthesis and 
properties of oligonucleotides. Nucl. Nucl. 1989, 

10 8, 1-21. 

37. Ramzaeva, N . ; Seela, F. 7-sbustitued 7-deaza-2'- 
deoxyguanosines : regioselective halogenation of 
pyrrolo[2, 3-d] pyrimidine nucleoside. He! v. Chlm. 
Acta. 1995, 78, 1083-1090. 

15 38. Ryu, E. K.; Ross, R. J.; Matsushita, T . ; MacCoss, 

M . ; Hong, C. I.; West, C. R. Phospholipid- 
nucleoside conjugates. 3. syntheses and 
preliminary biological evaluation of 1-p-D- 
arabinof uranosylcytosine 5 ' -monophophate-L-1 , 2- 

2 0 dipalmitin and selected 1-p-D- 

arabinof uranosylcytosine 5' -diphsphate-L-1, 2- 

diacylglycerosls . J. Med. Chem. 1982, 25, 1322- 
1329. 

39. (a) Robins, M. J.; Vinayak, R. S.; Wood, S. G. 
2 5 Solvent, not palladium oxidation state, is the 

primary determinant for successful coupling of 
terminal alkynes with iodo-nucleosides . 

Tetrahedron Lett. 1990, 31, 3731-3734. (b) Hobbs, 
F. W. Palladium-catalyzed synthesis of 

30 alkynylamino nucleosides. A universal linker to 

nucleic acids. J. Org. Chem. 1989, 54, 3420-3422. 

40. (a) Ramasamy, K. ; Imarura, N.; Robins, R. K.; 
Revankar, G. R. a facile and improved tubercidin 
and certain related pyrrolo [2 , 3-d] pyrimidine 



WO 2007/053702 PCT/US2006/042698 

80 

nucleosides by the sterospecif ic sodium salt 
glycosylation procedure [1] . J". Heterocyclic Chem. 
1988, 25, 1893-1898. (b) Ramasamy, K. ; Imarura, 
N . ; Robins , R. K. ; Revankar, G. R. a facile 
5 synthesis of ubercidin and related 7-deazapurine 

nucleosides via the sterrospecif ic sodium salt 
cgycosylation procedure. Tetrahedron Lett. 1987, 
28, 5107-5110. 

41. (a) Lee, S. E.; Sidorov, A.; Gourlain, T.; Mignet, 
10 N.; Thorpe, S. J.; Brazier, J. A.; Dickman, M. J. ; 

Hornby, D. P.; Grasby, J. A.; Williams, D. M. 
enhancing the catalytic repertoire of nucleic 
acids: a systematic study of linker length and 
rigidity. Nucleic Acids Res. 2001, 29, 1565-1573. 

15 (b) Giller, G.; Tasara, T.; Angerer, B.; 

Muhlegger, K; Amacker, M . ; Winter, H. 
Incorporation of reporter molecule-labeled 
nucleotides by DNA polymerase. I. Chemical 
synthesis of variousl reporter group-labeled 2'- 

20 deoxyribonucleoside-5' -triphosphate . Nucleic Acids 

Res. 2003, 31, 2630-2635. 

42. Collins, F. S., Green, E. D. , Guttmacher, A. E. & 

Guyer, M. S. (2003) Nature 422, 835-847. 

43. Thomas, J. W., Touchman, J. W., Blakesley, R. W. , 
25 Bouffard, G. G. , Beckstrom-Sternberg, S. M. , 

* 

* 

Margulies, E. H., Blanchette, M. , Siepel, A. C, 
Thomas, P. J. & McDowell, J. C. et al. (2 003) 

Nature 424, 788-793. 

44. Smith, L. M. , Sanders, J. Z., Kaiser, R. J., 
30 Hughes, P., Dodd, C, Connell, C. R. , Heiner, C, 

Kent, S. B. H. & Hood, L. E. (1987) Nature 321, 

674-679. 



f 1 

WO 2007/053702 PCT7US2006/042698 

81 

45. Ju, J., Ruan, C, Fuller, C. W., Glazer, A. N. & 
Mathies, R. A. (1995) Proc. Natl. Acad. Sci. C7SA 

92, 4347-4351. 

46. Doherty, E.A.S., Kan, C. W. and Barron, A. E. 
5 (2003) Electrophoresis, 24, 4170-4180. 

47. Drmanac, S., Kita, D., Labat, I., Hauser, B., 
Schmidt, C, Burczak, J. D. & Drmanac, R. (1998) 
Nat. Biotechnol. 16, 54-58. 

48. Fu, D. J., Tang, K., Braun, A., Reuter, D., 
10 Darnhof er-Demar, B., Little, D. P., O'Donnell, M. 

J., Cantor, C. R. & Koster, H. (1998) Nat. 

Blotechnol. 16, 381-384. 

49. Roskey, M. T . , Juhasz, P., Smirnov, I. P., Takach, 
E. J., Martin, S. A. & Haff, L. A. (1996) Proc. 

15 Natl. Acad. Sci. USA 93, 4724-4729. 

50. Edwards, J. R. , Itagaki, Y . & Ju, J. (2001) 
Nucleic Acids Res. 29, el04 (pl-6) . 

51. Kasianowicz, J. J., Brandin, E., Branton, D. & 
Deamer, D. W. (1996) Proc. Natl. Acad. Sci. USA 

20 93, 13770-13773. 

52. Ronaghi, M., Uhlen, M. & Nyren, P. (1998) Science 

281, 363-365. 

53. Braslavsky, I., Hebert, B., Kartalov, E. & Quake, 
S. R. (2003) Proc. Natl. Acad. Sci. USA 100, 

25 3960-3964 . 

54. Mitra, R. D., Shendure, J., Olejnik, J., Olejnik, 
E. K. & Church, G. M. (2003) Anal. Biochem. 320, 

55-65. 

55. Hyman, E. D. (1988) Anal. Biochem. 174, 423-436. 
30 56. Ju, J., Li, Z., Edwards, J. & Itagaki, Y. (2003) 

U. S. Patent 6,664,079. 



/ ( 

1 



WO 2007/053702 PCT/US2006/042698 

82 

57. Seo, T. S., Bai, X., Ruparel, H., Li, Z., Turro, 
N. J. & Ju, J. (2004) Proc. Natl. Acad. Sci. USA, 

101, 5488-5493. 

58. Ruparel, H., Bi, L., Li, Z., Bai, X., Kim, D. H. , 
5 Turro, N. J. & Ju, J. Design and Synthesis of a 

3'-0-Allyl Modified Photocleavable Fluorescent 
Nucleotide as Reversible Terminator for DNA 
Sequencing by Synthesis (2005) submitted to Proc. 
Natl. Acad. Sci. USA. 
10 59. Rosenblum, B. B., Lee, L. G., Spurgeon, S. L . , 

Khan, S. H., Menchen, S. M . , Heiner, C. R. & 
Chen, S. M. (1997) Nucleic Acids Res. 25, 

4500-4504 . 

60. Zhu, Z., Chao, J., Yu, H. & Waggoner, A. S. 
15 (1994) Nucleic Acids Res. 22, 3418-3422 

61. Rajasekharan Pillai, V. N. (1980) Synthesis 1, 

1-2 6. 

62. Barany, F. (1991) Proc. Natl. Acad. Sci. USA 88, 
189-193. 

20 63. Antao, V. P., Lai, S. Y. & Tinoco, I. Jr. (1991) 

Nucleic Acids Res. 19, 5901-5905. 



WO 2007/053702 PCT/US2006/042698 

83 

What is claimed is: 

1. A method for making 3' O-allyl-dGTP-PC-Bodipy-FL- 
510 comprising performing the steps set forth in 
Figure 7 . 

2. A method for making 3' -O-allyl-dATP-PC-ROX 
comprising performing the steps set forth in 
Figure 8 . 

3. A method for making 3' -O-allyl-dCTP-PC-Bodipy-650 
comprising performing the steps set forth in 
Figure 9. 

4. A method for making 3' -0-allyl-dUTP-PC-R6G 
comprising performing the steps set forth in 
Figure 10. 

5. A method for determining the sequence of a DNA 
comprising performing the following steps for each 
residue of the DNA to be sequenced: 

(a) contacting the DNA with a DNA polymerase 

> 

in the presence of (i) a primer and (ii) 
four fluorescent nucleotide analogues 
under conditions permitting the DNA 
polymerase to catalyze DNA synthesis, 
wherein (1) the nucleotide analogues 
consist of an analogue of dGTP, an 
analogue of dCTP, an analogue of dTTP or 
dUTP, and an analogue of dATP, (2) each 
nucleotide analogue comprises (i) a base 
selected from the group consisting of 
adenine, guanine, cytosine, thymine or 
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uracil, and analogues thereof, (ii) a 
deoxyribose, (iii) a fluorophore 

photocleavably attached to the base, and 
(iv) an allyl moiety bound to the 3'- 
oxygen of the deoxyribose, so that a 
nucleotide analogue complementary to the 
residue being sequenced is bound to the 
DNA by the DNA polymerase, and (3) each of 
the four analogues has a predetermined 
fluorescence wavelength which is different 
than the fluorescence wavelengths of the 
other three analogues ; 

(b) removing unbound nucleotide analogues; 

(c) determining the identity of the bound 
nucleotide analogues; and 

(d) following step (c) , except with respect to 
the final DNA residue to be sequenced, (i) 
chemically cleaving from the bound 
nucleotide analogue the allyl moiety bound 
to the 3' -oxygen atom of the deoxyribose 
and (ii) photocleaving the fluorophore 
from the bound nucleotide analogue, 
wherein steps (d) (i) and (d) (ii) can be 
performed concurrently or in any order, 
and step (d) (i) is performed using a Pd 
catalyst at a pH of about 8.8, 

thereby determining the sequence of the DNA . 

♦ 

6. .The method of claim 5, wherein chemically cleaving 
the allyl moiety bound to the 3' -oxygen atom is 
performed using Na2PdCl4. 



7. The method of claim 5, wherein the primer is a 
self -priming moiety. 
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8. The method of claim 5, wherein the DNA is bound to 
a solid substrate. 

9. The method of claim 8, wherein the DNA is bound to 
the solid substrate via 1,3-dipolar azide-alkyne 
cycloaddition chemistry . 

10. The method of claim 8, wherein about 1000 or fewer 
copies of the DNA are bound to the solid 
substrate . 

11. The method of claim 5, wherein the four 
fluorescent nucleotide analogues are 3'-0-allyl- 
dGTP-PC-Bodipy-FL-510, 3' -O-allyl-dATP-PC-ROX, 3' - 
O-allyl-dCTP-PC-Bodipy-65 0 and 3' -0-allyl-dUTF-PC~ 
R6G. 

12. The method of claim 5, wherein the DNA polymerase 
is a 9°N polymerase. 

13. A method for removing an allyl moiety from the 3'- 
oxygen of a nucleotide analogue's deoxyribose 
moiety comprising the step of contacting the 
nucleotide analogue with a Pd catalyst at a pH of 
about 8.8. 

14. The method of claim 13, wherein the Pd catalyst is 
Na 2 PdCl 4 . 
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